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Abstract 

The unification problem in algebras capable of describing sets has been tackled, directly 
or indirectly, by many researchers and it finds important applications in various research 
areas — e.g., deductive databases, theorem proving, static analysis, rapid software proto- 
typing. The various solutions proposed are spread across a large literature. In this paper 
we provide a uniform presentation of unification of sets, formalizing it at the level of set 
theory. We address the problem of deciding existence of solutions at an abstract level. This 
provides also the ability to classify different types of set unification problems. Unification 
algorithms are uniformly proposed to solve the unification problem in each of such classes. 

The algorithms presented are partly drawn from the literature — and properly revisited 
and analyzed — and partly novel proposals. In particular, we present a new goal-driven 
algorithm for general ACIl unification and a new simpler algorithm for general {Ab){C£) 
unification. 



KEYWORDS: Unification Theory, Set Theory, ACIl Unification. 



* A. Dovier is partially supported by MIUR project Sybilla, and by GNCS 2005 project on 

constraints and their applications, 
t E. Pontelli is partially supported by NSF Grants CNS-0220590, CNS-0454066, and HRD- 

0420407. 

I G. Rossi is partially supported by MIUR project AIDA, and by GNCS 2005 project on con- 
straints and their applications. 



2 



Agostino Dovier, Enrico Pontelli, and Gianfranco Rossi 



1 Introduction 

Sets are familiar mathematical objects, and they are often used as an high-level 
abstraction to represent complex data structures, whenever the order and repe- 
titions of elements are immaterial. A key operation when dealing with set data 
structures is comparing two sets. According to the traditional extensionality ax- 
iom l|Kunen 1980|l . two sets are equal if and only if they contain the same elements. 
The problem of set equality is usually formally addressed within first-order logic. In 
this context, a set is represented by a first-order term, called a set term, built from 
symbols of a suitable alphabet, using selected function symbols as set constructors. 
Since, in general, variables can occur within a set term in place of either individuals 
or sets, solving equations between set terms amounts to solving a set uniRcation 
or a set matching problem. Intuitively, the set unification problem is the problem 
of computing (or simply testing the existence of) an assignment of values to the 
variables occurring in two set terms which makes them denote the same set. Set 
matching can be seen as a special case of set unification, where variables are al- 
lowed to occur in only one of the two set terms which are compared. Set unification 
can be thought of as an instance of i?-unification IjSiekmann 1989|l . i.e., unification 
modulo an equational theory E, where E describes the (semantic) properties of the 
interpreted symbols used to represent sets. 

Two main approaches for representing sets as terms have been presented in the 
literature. The union-based representation makes use of the union operator (U) to 
construct sets, while the list-like representation builds sets using an element inser- 
tion constructor (typically denoted by {• | •}). The list-like representation has been 
frequently used in the context of logic languages embedding sets. It is used for in- 
stance in dKuper 1990| ), in ( |Jayaraman 1992| ), in IjBeeri et al. 1991|l — where {■ | •} 
is called scons — in the language {log} l|Dovier et al. 1996| . and in the Godcl lan- 
guage ( |Hill and Lloyd 1994| ). In various papers dealing with computable set theory, 
{• I •} is used and called with llCantone et al. 20T)T)l . 

The union-based representation, on the contrary, has been often used when deal- 
ing with the problem of set unification on its own UBiittner 1986l|I7ivesey and Siekmann 1976| ), 
where set unification is dealt with as an Associative-Commutative-Idempotent (ACI ) 
unification problem — i.e., unification in presence of operators satisfying the Associa- 
tivity, Commutativity, and Idempotence properties. In ( |Legeard and Legros 1991D 
sets are represented using the union-based approach; however, since set operations 
are evaluated only when applied to ground sets, set unification is not required at 
all. 

The computational complexity properties of the set unification and set matching 
problems have been investigated by Kapur and Narendran | |Kapur and Narendran 1986| 
|Kapur and Narendran 1992| ), who established that these decision problems are NP- 
complete. Complexity of the set unification/matching operation, however, depends 
on which forms of set terms (e.g., flat or nested sets, with zero, one, or more set 
variables) are allowed. The form of set terms in turn is influenced by the set con- 
structors used to build them. Thus, different complexity results can be obtained for 
different classes of set terms. 
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In this paper we present a uniform survey of the problem of unification in presence 
of sets, across different set representations and different admissible classes of set 
terms. We provide a uniform presentation of a number of different approaches and 
compare them. Unification algorithms for each considered unification problem are 
presented and analyzed. These algorithms are either drawn from the literature or 
they represent novel solutions proposed by the authors. In particular a goal-driven 
algorithm for general ACI unification is proposed, together with a new algorithm 
(with a simple termination proof) for general {Ab){C£) unification. 

1 . 1 Application Domains of Set Unification 

Various forms of set unification have been proposed by many authors, in different 
application frameworks: 

Declarative programming languages with sets: Various declarative program- 
ming languages relying on sets as Brst-class objects have been proposed, which 
provide different forms of set unification. Most of these languages are instances of 
the Constraint Logic Programming paradigm fPo vier et al. 1996 ^'Dovier et al. 20001 
lYakhno and Petrov 2000jl or of the Functional-Logic paradigm fjay araman 1992, 
[Arenas-Sanchez and Rodriguez- Artalejo 2001) ). The specification language Z ( |Spivey 1992| ) 
makes use of sets as data abstraction; attempts have been made to produce ex- 
ecutable versions of Z, such as the ZAP compiler ( [Grieskamp 1999| ) (whose im- 
plementation, however, does not embed a set unification algorithm). 

Deductive databases: Various proposals have been put forward for embedding 
sets as primitive data structures in deductive database languages, providing set 
unification or set matching as a built-in mechanism for set manipulation (jLiu 19981 
lAbiteboul and Grumbach. 1991irNaqvi and Tsur 1989|[Shmueh et al. 1992l|Lim and Ng 1997| 
IKifer and Lausen 1989|l . In these frameworks, it is common to deal with sets in- 
volving unions of variables. 

Al and Automated deduction: Set abstraction and operations have been shown 
to be fundamental in various subfields of Artificial Intelligence. They have been 
used as tools for the description of linguistic theories in Natural Language Pro- 
cessing IjManandhar 1994)l . In particular, unification based grammars augmented 
with set descriptions (e.g., URounds 1988IIPollard and Moshier'1990^1 ') require set 
unification. Set unification has been used in discovery procedures for determin- 
ing categorial grammars from linguistic data (e.g., UMarciniec 1997|l '). Set data 
structures have also been used in pattern matching and pattern directed invo- 
cation in various Al languages ( [Livesey and Siekmann 1976| ). Proposals dealing 
with computable properties and algorithmic manipulation of set structures have 
appeared also in the area of automated deduction, e.g., to reduce the length of 
proofs l|Policriti and Schwartz 1997jl . 

Program analysis and Security: Codish and Lagoon ( |Codish and Lagoon 2000'| ) 
described an application of elementary ACIl unification to the problem of sharing 
analysis of logic programs. Wang et al. show how a system based on CLP [SET) 
(hence, on set unification) can be used to model access control (|Wang et al. 2004|). 
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1.2 Unification Algorithms 

The problem of solving set unification has been mostly tackled in the form of ACI 
unification, and unification algorithms, returning the set of all the solutions to a 
given problem, have been proposed. The first work proposing a viable solution to 
ACI unification is ( |Livesey and Siekmann 19 76). This work mostly deals with AC 
unification — by reducing it to the solution of Diophantine equations — and only in 
the end it suggests a solution of the A CI problem, by replacing arithmetic equations 
with Boolean equations. Direct solutions of the ACI problem have been proposed 
by Biittner (|Biittner 1 9861 and Baader and Biittner l|Baader and Biittner 1988)l . 
More recently, Baader and Schulz IjBaader and Schulz 1996|l provided a general 
methodology allowing the unification with constants algorithms proposed for A CI 
to be extended to general ACIl unification algorithms. 

In recent years, a number of efforts have emerged that propose set unification 
algorithms for the list-like representation of sets, hence for a different equational the- 
ory (called {Ab){C£) in IjDovier et al. 1996| ). A first proposal in this direction is the 
algorithm sketched by Jayaraman and Plaisted in ( |Jayaraman and Plaisted 1989| ). 
A more general and complete algorithm is the one in IDovier et al. 199611 . The prob- 
lem of set unification in this context has been tackled by different authors IjArenas-Sanchez and Dovier 19971 
IDovier et al. 1998l|Storzenburg 1996||Stolzenburg 1999|IDantsin and Voronkov 1999"|l . 
In particular, the algorithms presented in IjArenas-Sanchez and Dovier 1997jl and ( |Stolzenburg 1999| ) 
provide solutions which are optimal, in terms of number of unifiers, for large classes 

of unification problems. The algorithms in IjArenas-Sanchez and Dovier 1997llDantsin and Voronkov 1999|l 
ensure polynomial time complexity in each non-deterministic branch of the compu- 
tation. 

Various authors have considered simplified versions of the (Ab){C£) problem 
obtained by imposing restrictions on the form of the set terms. In particular, 
various works have been proposed to study the simpler cases of matching (e.g., 
UShmueli et al. 1992|l ') and unification of Bound Simple set terms, i.e., bound set 
terms of the form { Si , . . . , s„ } , where each Si is either a constant or a variable IjArni et al. 19921 
lArni et al. 1996ir(Treco 1996| . A parallel algorithm for such restricted {Ab){ C£) uni- 
fication has been presented in ( |Lim and Ng 1997| ). Set matching is also discussed 
in dKapur and Narendran 1986| ). 

All these algorithms, however, have been developed in separate contexts, without 
considering any relationship among them. They have never been formally compared 
and related. A contribution of this paper is to provide a uniform presentation of 
the problem, covering most of its different instances, and surveying the different 
solutions developed. 

1.3 Overall Structure of the Paper 

The paper is organized as follows. In Section El we define the universe of sets we 
are dealing with, along with a suitable abstract syntax for representing them and 
a syntactical classification of the set unification problems. In Section |31 we present 
a number of examples of unification problems which provide motivations for set 
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unification. In Section ^ we discuss the complexity of the set unification decision 
problem for each syntactic class of set terms listed in Section |2 In Section [S] we in- 
troduce the basic notions and notation concerning i?-unification and the equational 
theories used in i?-unification with sets. In Section |S1 we describe the problem of 
ACIl unification with constants and its impact on set unification. In Section [T] we 
extend the discussion to the {Ab){C£) unification problem, i.e., the problem of set 
unification in presence of set terms based on the element insertion constructor {• | •}, 
and we present a new algorithm for this case. In Section |H1 we tackle the most gen- 
eral problem of unification of terms containing both ACIl and free (uninterpreted) 
fmiction symbols. A new general ACIl unification algorithm is presented. Some 
related topics are discussed in Section 13 and concluding remarks are presented in 
Sectional In |Appendix A| the proofs of the main results of the paper are reported. 

2 Sets and the Set Unification Problem 

In this section we characterize the universe of sets we deal with, and we discuss some 
well-known operations on sets. Finally, we formally introduce the set unification 
problem. 

2.1 A Universe of Sets 

A set is an arbitrary, unordered, collection of elements. Typically, a set is specified 
either intensionally, by means of a property that characterizes membership to the 
set, or extensionally, by explicit enumeration of all its elements. In this paper we 
restrict our attention to extensional sets. For instance, {a, b, c} is the (extensional) 
set which contains exactly the elements a, b, and c. We denote mathematically the 
fact: "a belongs to the set {a, b, c}" using the membership relation: a £ {o, b, c}. 
We assume the extensionality axiom (|Kunen 19 80) that states that two sets are 
equal if and only if they contain the same elements. Thus, {a, 6, c} is the unique 
set containing exactly a, &, and c. {a, c, b}, {b, a, c}, etc. are alternative ways to 
describe the same set. A particular set is the empty set 0, that contains no elements. 
A set containing only one element is said to be a singleton. If 5 is a set, then we 
will denote with \s\ its cardinality. 

A set is finite if it contains a finite number of elements.^ For instance 0, {0}, and 
{a, 6, c} are finite sets. However, this definition does not remove all possible cases 
leading to infinity. The singleton set {N} is a finite set, but its unique element N is 
an infinite set. A set is said to be hereditarily finite if it is finite and all its elements 
are hereditarily finite. This definition leaves still a further possibility for infinity. Let 
us consider the sets x and y that satisfy the equations x = {0, y}, y = {x}. They are 
hereditarily finite, but they hide an infinite descending chain x3y3x3y3---. 
These sets, where the membership relation is allowed to be not well-founded, are 

^ A precise, formal, characterization of the notion of finiteness is outside the scope of this work. 
For a theoretical analysis of this topic see iTarski 1924J . 
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called non well-founded sets (or hypersets) IjAczel 19881 IBarwise and Moss 1996|l . 
Hypersets are very important in some areas, such as concurrency theory, but they 
are not accepted in traditional set theory, where sets are expected to be well- 
founded. 

Let us focus on hereditarily finite and well-founded sets. We can consider two 
approaches to set theory: 

• pure sets, in which the only entity that docs not contain elements is the empty 
set 0, and 

• sets with individuals, in which there exists a collection U of individuals, where 
each element of U is not a set and does not contain elements. Since the 
elements of U are not sets, we also have that % ^U. 

In the second approach, the extensionality axiom has to be revised for the elements 
of U, since 

(i) two individuals are different even if they contain the same elements (namely, 
none), and 

{a) all the elements in lA are different from 0. 

In this paper we will focus on the approach based on sets with individuals, as it 
generalizes the pure sets approach (by taking U — ^). 

Let us introduce the universe of sets we are interested in (see also IjCantone et al. 20011 
pg. 88)). As usual, the subset relation x C y denotes the formula Vz {z € x ^ z € 
y). If s is a set, with pfi„(s) = {x : x C s A x is finite} we denote the set of all its 
finite subsets. 

Definition 1 

The Universe HF^ of hereditarily finite sets based on U is obtained as follows: 

< HF^+i HF^ U pfi„ (^HF^) 

. MF" = U.^nHF^ 

The sets in HFq contain the finite subsets of the set of individuals: these particular 
sets are called flat sets. The sets introduced in HF^, with i > 0, may contain 
elements that are sets themselves. We refer to such sets as nested sets. For instance, 
\iU — {a, &, c}, then HFJ^ consists of the flat sets: 

07 {a}, {b}, {c}, {a, &}, {a, c}, {6, c}, {a, &, c} 

Some nested sets are the following: 

W,{{a}}, {0,{{a},&},{{{c}}}} 

2.2 Abstract Set Terms 



So far we have represented sets by exploiting the usual intuitive notation based on 
braces and commas. In order to deal with sets as primitive data objects in a first- 
order language, however, we need to precisely represent them as first-order terms 
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of the language. For this reason, one or more function symbols are selected to be 
used as set constructors. Set constructors will allow complex sets to be built from 
simpler ones. Many different approaches are possible. The two approaches that, to 
our knowledge, have received more attention in the literature are the following. 

1. Union-based representation. This solution is based on the use of the union 
constructor U and, possibly, the singleton constructor {•}. sUt represents the 
set which contains the elements of the sets s and t, that is, 

s U t = {x : X E s V X ^ t} , 

while {t} represents the set containing the single element t. With this ap- 
proach, the finite set {io, ■ ■ • , ^n} is represented by a union of singletons: 
{to}U ■ ■ -U {tn}, where to, . . . ,tn are either sets or individuals. The empty set 
is represented by a distinguished constant 0. 

2. List-like representation. An alternative representation of sets is based on the 
element insertion constructor {-I-}. represents the set obtained by 
adding the element t (either a set or an individual) to the set s if it is not yet 
in s, that is 

{t \ s} = {x : X d s W X — t}- 

The empty set is represented by a distinguished constant 0. Thus, the finite 
set {to, . . . , tn} is represented by a sequence of element insertions: 

{to\{---{tn\n---}} 

where to, . . . ,tn are either sets or individuals. 

As far as the syntactic representation of the individuals (i.e., the elements oilA) 
is concerned, we can represent them either 

• as constant symbols different from (simple individual terms) or 

• as terms of the form f{ti, . . . , i„), n > 0, f different from U and {• | •}, and 
ti, . . . ,tn terms {general individual terms) . 

Both the union-based and the list-like representations allow the elements of the sets 
to be either individual terms or other set terms. Individual and set terms can be 
nested at any level. 

Let us observe that the element insertion constructor {• | •} can be represented 
using U, i.e., {s 1 1} =' {s} U t. However, in l|Dovier et al. 2fl00jl it is proved that, 
without singleton sets, the two symbols are not mutually definable, unless we allow 
the use of complex formulae involving universal quantifiers. Observe moreover that 
the U symbol allows one to define set inclusion: x C_ y is equivalent to x U y = y. 

Furthermore, let us observe that the definition of U, being based on membership, 
makes sense on sets, not on individuals. For instance, the union of two individuals 
a and b would be a memberless object. There is no way of stating that a is equal 
or different from a U b without introducing new, non-standard, axiomatizations. 
For this reason, we assume that the U constructor is used only on sets. Similar 
considerations apply to the second argument of the {• | •} operator. 
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For the sake of simplicity, in the rest of the work we will make use of a sim- 
pler and more intuitive abstract syntax to denote sets, disregarding the concrete 
representation used to encode them as terms in the language at hand. 

Definition 2 

An abstract set term is a term of the form 

{Xi, . . .,Xm, ai, . . . , ttn, Si, . . . , Sp} U Yi U ■ ■ ■ LI Yq ni,n,p,q>0 

where Xj, Yj are variables, Ui are individual terms, and Sj are abstract set terms 
distinct from variables. The Yj variables are called the set variables of the abstract 
set term. In particular, 

• when ra — n — p — q = 0, the term is simply written as 0. 

• when m = n = p = and ? = 1, the term is the set variable Fi. 

The size \\s\\ of an abstract set term s is the number of occurrences of symbols in 
s. 

As a notational convention, we will usually use a, 6, c, possibly subscripted, to 
denote individual terms, and r,s,t, possibly subscripted, to denote (abstract) set 
terms or individual terms. Variables are denoted by identifiers with capital letters. 

When q < 1, the abstract set term can be rendered concretely using both 
representations described above. For example, {Xi, X2, a, b, c} U Yi can be seen 
as a shorthand for both the concrete terms {Xi|{X2|{a|{6|{c| Yi}}}}} and 
{Xi} U {X2} U {a} U {b} U {c} U Yi. Conversely, when q> I, the U constructor is 
required; thus, only the union-based representation is feasible. 

When clear from the context we will omit the word "abstract" , referring to ab- 
stract set terms simply as set terms. 

Set terms may contain variables, both as individuals (the variables X,'s) and as 
sets (the variables Yj's). A set term containing variables denotes a possibly infinite 
collections of sets. For instance, the term {a,X, b} denotes all sets containing two 
individuals, a and b, and possibly a third unknown element X.lf X takes the value 
a or 6 then the set will have only 2 elements. Otherwise, e.g., X = c, the set will 
contain three elements. Note that the set terms {a, a, b}, {a, b, a}, {b, a, a, b}, etc. 
are accepted notations for the same set, i.e., the (unique) set containing exactly a 
and b. Note also that variables in set terms could be implicitly forced to assume 
set values using the fact that the U constructor requires two set arguments. Thus, 
for instance, the variable Y in the set term {a, 6} U Y can take only set values. Set 
terms are called non-ground {ground) if they do (do not) contain variables. Finally, 
note that general individual terms can be non-ground. For instance, f{X, Y) is a 
non-ground term, but the fact that the outermost symbol is not a set constructor 
ensures that it is an individual. 

Example 1 

The following are abstract set terms. 

• {1, 2, 3} (m = 0, n = 3, p = 0, g = 0) 

• {0} (m = 0, n = 0, p = 1, g = 0) 



Set Unification 



9 



• {Xi,X2, a, 6, c, rf} U r (m = 2, K = 4,p = 0, g = 1) 

• Y1UY2 {m = 0,n = 0,p = 0,q = 2) 

• {X, a,6,c,{l,2,3},{0}} (m = 1, n = 3, p = 2, g = 0) 

. {Xi,X2,a,/({a,0}),0}U Fi (m = 2, n = 2,p = 1, g = 1) 



2.3 Set Equivalence and Set Unification 

The most natural decision test regarding set terms is testing whether they represent 
the same set or, in the case of non-groundness, testing whether there exists an 
assignment for the variables that forces the two terms to represent the same set. 

Definition 3 

Given two terms s and t, s = t is said to be an equation. A conjunction Si = 
ii A • • • A s„ = tn of equations is said to be a system of equations. Systems of 
equations are also commonly viewed as sets of equations. 

If Xi , . . . , X„ are the variables occurring in a system of equations C, we denote with 
3C the formula 3Xi • • • 3X„ C. The existence of an assignment for the variables in 
s and t that forces the two terms to represent the same set will be denoted by 
HF \=3s = t, formally defined below. 

Before defining the interpretation of ground abstract set terms in HF, we first 
show how individual terms (syntax) can be related to individuals (semantics). Let 
us assume that U is an infinite set of individuals. Simple individual terms denote dis- 
tinct elements of ZY. For the sake of simplicity, in our examples, the individual terms 
a,b,c, . . . will be interpreted as the corresponding individuals a,b,c, . . . of U — we 
use the so-called unique name assumption. General individual terms /(si, . . . , Sm) 
and g{ti, . . . , tn), with / different from g, denote distinct elements of U, different 
from all the individuals associated to the simple individual terms. Each function 
symbol / of arity n is interpreted as a one-to-one function f™ from HF to U. 

Definition 4 

If s = {ai, . . . , Qn, si, . . . , Sp} is a ground set term, then its interpretation in HF, 
denoted by s™, is the following set: 

• if n = and p = then s™ is the empty set 

• otherwise, s™ is the set containing exactly the elements a™, . . . , a™ and 
sr,...,5™, where 

— if flj is a simple individual term, then a™ is simply the corresponding 
individual. 

— if tti is of the form f{ti, . ■ . ,tn) then a™ is the individual associated to 

If s and t are two ground set terms, then HF \= {s = t) if and only if s™ is the 
same set as t™. 

If s and t are two set terms, and Xi, . . . ,X„ are all variables in s and t, then 
HF \= 3s = t a and only if there exists an assignment a of ground set terms to 
Xi, . . . , X„ such that HF |= (s = t)a. 
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Definition 5 

If s and t are two set terms, the set uniRcation decision (SUD) problem is the 
problem of checking whether HF \= 3{s = t). If s and t are ground, the problem is 
also called set equivalence. 

Definition 6 

If s and t are two set terms and Xi, . . . , X„ are the variables occurring in them, the 
set uniBcation solution (SUS) problem is the problem of finding an assignment a of 
sets and/or individuals terms to the variables Xi, . . . , X„, such that HF |= (s = t)a. 

We give a more standard and complete definition of the unification problem in 
Section|Sl Note that, if two general individuals have the same outermost symbols but 
the ordered list of arguments is different, then they denote two distinct individuals 
(e.g., /(a, b) and /(&, a)). However, if the two individual terms contain set terms 
as their arguments, in order to decide whether the individuals are or not the same, 
one needs to compare the sets denoted by the involved set terms. For example, the 
general individual terms /({a, b}, c) and 6, a}, c) denote the same individual 
since {a, 6} denotes the same set as {&, &, a}. 

From a computational point of view, the complexity of the SUD problem depends 
on the syntactic form of the two set terms s and t. As a matter of fact, while 
the set equivalence test of ground set terms denoting flat sets, such as {a, 6, c} 
and {6, c, a}, is rather easy, when the SUD problem deals with nested set terms 
involving variables it becomes NP-complete (see Section f4.4ll . Thus, in order to 
classify the set unification problem, we subdivide set terms in different syntactic 
classes. 

Definition 7 

For m > 0, n > 0,p > 0, g > 0, the class set(m, n,p, q) is the collection of abstract 
set terms of the form: 

{Xi, . . . , ai, . . . , a„/, si, . . . , Sp'} U Y^i U • • • U Y^i 

where < m' < m, < < n, < < p, < g' < g, and Si G set(m, n,|3, q). 

Observe that G set(m, q) for all m, n,p, q. Furthermore, set(77ii, qi) C 

set(m2, ^2, P2, 92) if "ii < and ni < 712 and pi < P2, and qi < q2. Interesting 
special cases can be obtained by setting some of these parameters to 0: 

ground = lJn>o p>o ^^"^(01 P^ 0)= the collection of set terms of the form 

{oi, . . . , a„, si, . . . , Sp}, with simple individual terms and Si ground set terms. 

gflat((7) = Un>o set(0, n, 0, q): the collection of set terms of the form 

{ ai , . . . , a„ } U Yi U ■ • ■ U Yqi , with simple individual terms and Yi variables 
ranging over gflat((7) sets (i.e., sets denoted by gflat(g) set terms) (0 < g' < q). 

flat(g) = Um>o n>o set(m, n, 0, q): the collection of set terms of the form 

{Xi, . . . , Xm, ai, . . . , a„}U YiU- • - U Yqi , with ai and Xi denoting simple individual 
terms, and Yi ranging over flat(g) sets (0 < g' < g). 
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nested((7) = lJm>o n>o p>o set(m, n,p, q): the collection of set terms of the form 
{Xi, . . . , Xm, ti, . . . , tn, si, • ■ ■ , Sp} U Fi U • • • U Yq', with ti general individual 
terms, Si non- variable nested(g) set terms, Xj ranging over general individuals or 
nested (g) sets, and Fj ranging over nested (g) set terms (0 < g' < 5). 

gflat(g) and flat(g) denote flat sots only, while ground and nested(g) account for 
nested sets. For the same q, we have that gflat(g) C i\at{q) C nested((i')- Moreover, 
ground is included in nested(g) (namely, in nested(O)), but it is not included in 
the other classes, since ground accounts also for nested sets. Actually, these classes 
could be further subdivided into finer subclasses. For instance, we could further 
distinguish between ground and non-ground nested set terms, with simple or general 
individuals. However, the four classes we identified above turn out to be sufficient 
for our analysis. 

Observe that, in the concrete representation of sets, the union constructor is not 
required whenever q < 1. For these sets we can use the list-like representation, 
based on the element insertion constructor {• | •}. For q > 1, instead, we need 
the union constructor, and possibly the singleton constructor. For instance, the 
abstract set term X U {Y}\J Z can be immediately encoded using the union-based 
representation while it has no corresponding encoding in the list-like representation. 
These observations will play an important role when we will address the various 
unification problems. 

3 Examples 

This section presents a series of instances of the set unification problem. This allows 
us to give an intuitive idea of the expressive power of the different frameworks 

considered in the rest of the paper. 

Chords: this is the problem of determining whether two sets of musical notes de- 
note the same chord (a chord is a set of at least three notes — i.e., order and 
repetitions do not matter). We can encode the problem as a set unification prob- 
lem between two (flat) ground set terms: 

{c, e,g, 6b} = {g,g, e, 6b, c, e} 

where c, e, g, 6b are constants representing musical notes (i.e., individuals of the 

language). 

Courses covering: this is the problem of verifying whether two teachers are cov- 
ering three courses in their current course assignment. The problem can be en- 
coded as a set unification problem between a gflat(q) set term, composed only of 
variables, for the teachers, and a (flat) ground set term for the courses: 

Teaciieri U Teacher2 = {coursei,course2, courses} 

Note that, in this case, variables range over unions of elements (i.e., subsets of 
coursei U course2 U courses) rather than simply over individuals. Thus we cannot 
use the list-like representation for its concrete rendering. 
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Graph Coloring: Let us consider the graph-coloring problem consisting of the 
undirected graph 

( {Xi,X2,X3, Xi} , {{Xi, X2}, {X2, X3}, {X3, X4}, {X4, Xi}} ) 

and a set of colors 

{red, green, blue} 

This problem can be easily encoded as a single equation between two nested (q) 
(g > 1) set terms in the following way: 

{{Xi, X2}, {X2, X3}, {X3, X4}, {X4, Xi}} U i? = 

{{rerf, green}, {red, blue}, {green, blue}} 

The right-hand side set is used to encode the set of all viable unordered pairs of 
colors, and it can be a ground set term. 

The solution of this equation (see Definition ^ provides a solution of the cor- 
responding graph-coloring problem. A possible solution (actually, the first one 
returned by the CLP (SET) interpreter UDovier et al. 2000|l ') is: 

Xi = red, X2 — green, X3 — red, X4 — blue, R = {{green, blue}} 

Solutions that make use of only two colors are also computed, such as Xi = X3 = 
red, X2 = X4 — green and R = {{red, blue}, {green, blue}}. 
Finite State Automata: Let us consider a deterministic finite state automata on 
the alphabet {0, 1}, containing the set of states Q — {qo, . . . , Qn-i, where qo 
is the initial state and g„ is the unique final state. Qi — {qo, . . . , qn-i} denotes the 
set of non-final states of the automata. We would like to "learn" the structure of 
the automata by looking at positive and negative examples of strings that should 
be either accepted or rejected. This problem can be encoded as follows. The set 
of transitions D is represented by a nested(q) {q > 0) set term whose elements 
are triples {source, symbol, destination) (where (• , • , •) is a ternary free function 
symbol used to build the triples): 

D = {(go, 0, Xo,o), (go, 1, Xoa), . . . , (g„, 0, X„,o), (g™, 1, X„,i)} 

Observe that the destination of each transition is, at this point, unknown. If 
ciQ ■ ■ ■ ttk is a string of length fc -I- 1 that should be accepted, then we need to add 
an equation: 

{(go, oo, Yi),{Yi, ai, Y2), ...,{Yk, at, qn)} U D = D 

that forces the transitions (go, oq, Yi), ( Yi, ai, Y2), • ■ ■ , (Yk, a^, g„) to belong to 
D. Note that the left-hand side of the equation is a nested(g) set term (g > 1). 
Therefore, we can use the concrete list-like representation to encode it, based on 
the element insertion constructor {• | •}, as well as the union-based representation. 
If bo - ■ ■ bh is a string that should not be accepted, then we need to add the 
equations: 

{iqo,bo,Yi),iYi,bi,Y2),...,iYh,bh,Yt+i)}UD ^ D , {Y^+i} U Qi ^ Qi 
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Fig. 1. The DFA computed from examples and counterexamples 

that force the state Yh+i resulting from the execution to be in Qi, and hence 
not a final state. 

For example, if we want a four-state automata that accepts the strings 000 and 
001 and rejects the strings Oil and 10, then we write the system of equations: 



Q = {90, gi, 92, gs}, Qi = ?i, 92}, 

D = {(go,0,Xoo),(go,l,^oi),(gi,0,Xio),(gi,l,Xii), 

(92, 0, X20), (52, 1, X21), (93, 0, X30), (ga, 1, ^31) }, 
{W^3}UQi = Qi,{if2}UQi = Qi, 
{(go,0, Wi),{Wi,l,W2),{W2A, W:i)}\JD^D, 
{iqo,l,K,),iKi,0,K2)}UD = D, 
{(go,0, ri), (n, 0, ^2), (^2, 0, 93)} UD^D, 
{(go, 0, ^1), (^1,0, Z2), {Z2, 1, ?3)} U - 7? 



A possible solution (the first one returned by the CLP{S£T) interpreter) is (see 
also Figure^: 

D = {(go,0, gi), (go, l,?2),(gi,0, 93), (91,1,92), 
(?2,0, go), (92, 1, go), (93,0, 53), (93, 1, 93)} 

Paths and Subgraphs: Let us represent an undirected graph G as the set of 
all its edges, where each edge is represented by the set of its two constituting 
nodes. Let us consider the problem of computing all the subgraphs of G with 
nodes {ci, . . . , c„} such that each subgraph contains at least one path between 
two given nodes, e.g., ci, c„. This problem can be immediately encoded as a set 
unification problem. In fact, all the subgraphs of G are given by the solutions for 
Gi of the equation 



The subgraphs containing the required path can be obtained by adding the equa- 
tions: 



- Gi\J{{ci, ci}, . . . , {c„, c„}}, G3 = {{ci, Xi}, {Xi, X2}, . . . , {X„_i, c„}}UG3 



Observe that Gi is temporarily extended to the new graph G3 by introducing 
artificial loops, thus allowing us to recognize paths of length less than n. Also, 
observe that the equation Q cannot be rendered concretely using the list-like 



Gi U G2 = G 



(1) 
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Fig. 2. Two bisimilar trees obtained from {{0}, {0, {0}}} and {{{0, 0}, 0}, {0}, {0}} 

representation, since its left-hand side set term involves more than one variable 
ranging over set terms (i.e., it belongs to the nested (g) class, q > 2). 

4 The Set Unification Decision Problem and its Complexity 

In this section, we discuss the complexity of the Set Unification Decision problem 
for each one of the syntactic classes of set terms listed in Section |21 

4-1 SUD for the ground Class 

The set equivalence test for two ground abstract set terms s and t can be solved 
in worst-case time 0(||s|| + H^H) (see Definition O . The proof is based on a tree 
representation of a well-founded set and on the existence of a fast algorithm for 
proving graph bisimulation. We first focus on the pure case (without individuals). 

We can use a tree G — {N, E), rooted in h' E N, where is the set of nodes and 
E is the set of edges of G, to represent a pure set. Edges represent memberships, 
namely (m, n) means that m has n as an element, and the nodes in the tree denote 
all the sets that contribute to the construction of the set. A node without outgoing 
edges represents the empty set 0. It is possible to write a procedure that translates a 
ground set term denoting a pure set into a tree in linear time. An example showing 
two trees obtained in this way is shown in Figure |2] From the figure it is possible 
to observe the sets associated to the subtrees. 

According to IjAczel 1988|l . a bisimulation between a graph Gi — {Ni, Ex) and a 
graph G2 = {N2, E2) is a relation b Ni x N2 such that: 

1. Vm e iVi 3v e N2 such that (u b v) and Vu e N2 3u e Ni such that (u b v) 

2. {ui b U2) A {ui, vi) e Ei^ 3v2 e N2{ {vi b V2) A {u2, V2) G E2 ) 

3. (ui b U2) A {u2, V2) e E2^ 3vi e Ni{ {vi b V2) A {ui,vi) £ Ei )• 

We can use the notion of bisimulation on trees. Specifically, given a tree Gi, rooted 
in node vi , and a tree G2 , rooted in node 1^2 , Gi is bisimilar to G2 if and only if there 
exists a bisimulation b between Gi and G2 such that vi b V2- It is simple to verify 
whether the two trees of Figure El are bisimilar. Observe that conditions 13 and 13 
resemble the extensionality axiom ('Section 12. 1(1 — in fact, pure sets are equal if and 
only if their graph representations are bisimilar IjAczel 1988|l . In IjDovier et al. 20d4jl 
it is proved that bisimilarity between acyclic and rooted graphs can be tested in 
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linear time. This result is based on an algorithm that guesses an initial partition 
of the nodes — in particular, all leaves are initially placed in the same class — and 
refines it using a suitable computation strategy. 

As far as sets with individuals are concerned, the situation is similar. Let us 
assume that ai, . . . , a™ are the individuals occurring in the two terms. One can 
obtain the two trees as in the previous case, but adding a label to each node: for 
a set node and i if the node contains the individual . Then one can run the same 
graph algorithm as in the previous case with a single change at the beginning: the 
leaf nodes are split into different classes according to their labels. 

Remark 1 

In the procedure described above, for ground sets with individuals, we need to 
partition leaf nodes according to their labels (individual names) . A similar problem 
will emerge in other procedures presented in the paper, where constants symbols 
and variables must be ordered. If we assume that the input is given as a string 
and the set of constant /variable symbols used is known in advance, then we can 
order them in linear time using radix sort. If we assume that the input terms 
are represented by trees using structure sharing (namely, there are no multiple 
occurrences of nodes representing the same constant/label), we have an implicit 
ordering of constants given by their memory locations. If, otherwise, the input is 
simply a string or a graph without structure sharing, we first need to provide the 
ordering of the symbols used, which requires time 0((||s|| + ||<||) log(||s|| + \\t\\)). 

4.2 SUD for the gflat(g) Class 

Let q be fixed and consider two gflat(g) set terms to be tested: s — {ai, . . . , a„} U 
Yi---U Yq' and t = {h, . . . , bn'} U Zi U ■ ■ ■ U Z^" {q' < q and q" < q). Let: 

Vi = vars{s) \vars{t) Ci — con st s (s) \ const s{t) 

V2 = vars{t) \vars{s) C2 = con st s (t) \ const s{s) (2) 

V3 ~ vars{s) r]vars{t) C3 ~ consts{s) Ci consts{t) 

where vars{a) and consts{a) denote the set of variables and the set of simple 
individual terms occurring in the term a, respectively (see Remark^for a comment 
on the time required to determine these sets). 

If q' = q" = (i.e., vars{s) — vars{t) = 0), then we arc in the ground case 
studied in the previous section. 

If q' and q" are both greater than (i.e., vars{s) ^ and vars{t) ^ 0), 
then s and t are always unifiable: a solution can be obtained by assigning the 
set {fli, . . . , a„, 61 . . . , 6„'} to all the variables in vars{s) U vars{t). 

If exactly one between q' and q" is 0, then we have that: 

• if = 0, then the problem admits a solution if and only if C2 = 0; 

• if q" = 0, then the problem admits a solution if and only if Ci = 0. 

Thus, to solve the SUD problem for gflat(g) set terms we simply need to compute 
the sets Ci and Vi, a task that can be accomplished in time 0(||s|| + \ \t\\)- The 
considerations made in Remark Q apply to this case as well. 
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4.3 SUD for the flat(g) Class 
Let q be fixed and consider two f\at{q) set terms to be tested: 
s = {Xi, . . . , ai, . . . , a„} U Fi U • • • U 

and 

t = {Wi,...,W,n',h,...,bn'}LlZiU---U Zg„ 

{q' < q and q" < q), and let Vi and Ci be the sets defined in formula 
If 771 — m' = we are in the case gflat(g) studied before. If q' and q" are both 

greater than 0, then a trivial solution always exists, as in the gflat(g) case. 

If q' = q" = 0, then we can observe that a necessary condition for the existence 

of a solution is that: 

\Vi\ + \V2\ + \V^\ > |Ci| + |C2|,|Fl| + |V^3| > \C2\,\V2\ + \V3\ > \Ci\ (3) 

Condition is also sufhcient. If holds, then we will be able to construct a 
solution by assigning a different value from C2 to each variable in Vi, a different 
value from Ci to each variable in V2, and by assigning all remaining elements of 
Ci and C2 (if any) to the variables in V3. Condition ^ guarantees that there 
are enough variables in V3. If some variables are not assigned by this algorithm, 
then the solution can be easily completed. For example, when \ Vi\ > IC2I, we 
can complete the solution by assigning any value from C2 or C3 to the remaining 
variables of Vi. 

If exactly one of q' or q" is (without loss of generality, let us assume q" = 0) , 
then we can determine Vi and Ci as in the previous cases, but without considering 
the variables Yi, Zi. The problem admits a solution if and only if | V2 1 + | V3 1 > \Ci\. 

Thus, the SUD problem for flat(g) set terms can be reduced to the problem of 
computing the sets Vi and Ci. This can be done in time 0(||s|| + \ \t\\) (again, 
see Remark 0. As discussed more extensively in Section TTM the class of problems 
flat(O) has been studied in IjArni et al. 1992|lCreco 1996|l . where these set terms are 
called Bound Simple set terms. 

4-4 SUD for the nested{q) Class 

The set unification test for nested sets with non-ground elements (i.e., with general 
individuals) has been proved to be NP-hard in ( |Kapur and Narendran 1986| ) even 
for the simple case of nested (g) with q — 0. We report here the NP-hardness proof 
from l|Dovier et al. 1996jl . Let us consider an instance of 3SAT, e.g.: 

{Xi V V A3) A (-Al V A2 V -A3) 

Checking its satisfiability is equivalent to testing set unification of the two following 
nested (0) set terms: 

{{Al, n}, {X2, Y2}, {X3, ^3}, {Al, Y2,X3, 0}, { Fi, A2, ^3, 0}} and {{0, {0}}} 

where we interpret as false and {0} as true. 

To prove that the test is in NP, instead, one needs to prove that, when it is 
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ground 


gflat(g), g > 


flat(g), g = 0, 1 


SUD Complexity 
Theory 


0(\\s\\ + \\t\\) 
{Ab)(C£) 

flat(g), g > 1 


o(lkll + lli|l) 

ACIl with constants 
nested(g), g = 0, 1 


0(11^11 + 11*11) 
{Ab)iCl) 

nested(g), g > 1 


SUD Complexity 
Theory 


0(\\s\\ + \\t\\) 
gen. ACIl 


NP 

mici) 


NP 
gen. ACIl 



Table 1. Complexity of the SUD problem s = t and i?-theory used to solve the 
SUS problem 

satisfiable, there exists a witness for this situation that can be verified in poly- 
nomial time. Proofs for this result are rather complex and they can be found 
in UKapur and Narendran 1992|lOmodeo and Policriti 1995|) . 

4-5 Summary of Results for the SUD Problem 

Table summarizes the complexity of the SUD problem for the different classes 
of set terms we have introduced. The Theory row will be explained in the next 
sections. 

4-6 Equations vs. Systems 

We have defined the SUD and SUS problems on a single equation. The notions can 
be extended to deal with systems of equations as well: in this case we need to check 
whether all the equations in the system are simultaneously unifiable. 

In the ground case nothing changes: each equation is analyzed independently. For 
gflat(g) we know from (K apui^nd Narendran 1992| .Hermann and Ko la itis 1997|l 
that the ACIl with constants unification problem for systems of equations can be 
reduced to propositional Horn satisfiability and, thus, it is in P. In Section El we 
prove the equivalence of this problem with the gflat(g) unification problem. 

As far as the flat(g) class is concerned, we can adapt the reduction of the 3SAT 
problem done for the nested (g) class, using the constant 1 instead of the set {0}. 
The instance of 3SAT is mapped to the system of equations: 

{Xi, Y,} = {0, 1}, {X2, Y2} = {0, 1}, {X3, ¥3} = {0, 1}, 
{Xi, Y2,X3, 0} = {0, 1}, { Y,, X2, Ys, 0} = {0, 1} 

where all equations involve only flat(g') set terms. Thus, while the flat(g) SUD 
problem for a single equation requires linear time, the same problem for systems of 
equations is NP-complete. 

Regarding the nested((7) class, each system of equations {si = <i, . . . , s„ = 
can be polynomially reduced to an equisatisfiable equation as follows: 

{(1, si), . . . , (n, s„)} = {(1, ti), . . . , (n, tn)} 

where n is a polynomial encoding of the natural number n (e.g., = 0, tt, + 1 — {n}) 
and {x, y) is an encoding of the ordered pair (e.g., {x, y) = {{x}, {x, y}}). Thus, 
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the complexity of the problem on systems of equations is the same as for a single 
equation. 

5 i?-Unification 

i?-unification is concerned with solving term equations modulo an equational theory 
E. Set unification can be seen as an instance of the i?-unification problem, where 
the underlying equational theory contains the identities that capture the properties 
of set terms — i.e., the fact that the ordering and repetitions of elements in a set are 
immaterial. Diff'erent approaches have been considered to encode sets. Accordingly, 
diff'erent choices of E should be considered to describe their basic properties. 

We assume the reader to be familiar with the notions of equational theory, E- 
unification, i?-unifier and related topics (see, e.g., IjSiekmann 1989l|Baader and Snyder 2001| )). 
In this section we introduce a few basic notations concerning i?-unification and set 
unification which will be useful in the rest of the paper. 

A signature E consists of a set of function symbols. Terms built from S and from 
a denumerable set V of variables are called E-terms. T(E,V) is the set of all the 
E-terms — and it is called the term algebra. Given a sequence of terms ii , . . . , f„ , 
vars{ti, . . . ,tn) denotes the set of variables occurring in the terms, vars is naturally 
extended to equations and sets of equations. 

A substitution a : V — > T(E, V) is represented by the notation [Xi/ti, . . . , X„/i„], 
where dom{a) — {Xi,...,X„} (the domain of cr) and, for each i = l,...,n, 
a{Xi) = ti. A substitution is miiquely extended to a function over T'(E,V) us- 
ing structural induction. The application of a substitution to a term t will be 
denoted by ta (or, equivalently, by <j{t)). 

An equational theory is a finite collection of identities E, where each identity 
is written as s ~ t, and s, t are terms belonging to T(E, V). The relation =b is 
the least congruence relation on the term algebra T(E, V), which contains E and 
it is closed under substitution ( |Baader and Snyder 2001| ). Function symbols not 
occurring in E are said to be free. 

We describe now the properties of the function symbols that we use as the set 
constructors. The properties that the U constructor should have in a set theory can 
be described by the following identities: 

(A) {XU Y)U Z w XU(YUZ) (Associativity) 
(C) XUY ^ Y(JX (Commutativity) 

(/) X U X ^ X (Idempotence) 

Moreover, the constant symbol 0, used to denote the empty set, is the identity 
element for the U operator. This is stated by: 

(1) XU0 w X (Identity) 

Let Each be the equational theory consisting of identities (A), (C), (/), and (1). 
The {■ I ■} constructor, instead, should exhibit the properties described by the 
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following identities: 

(Ab) {X\{X\Z}} « {X\Z} (Absorption) 

{C£) {X\{Y\Z}} « {Y\{X\Z}} (Commutativity on the left) 

A substitution a is an E-uniGer (or, simply, a uniRcr when the context is clear) 
of two terms s,t if su =e to — i.e., sa and to belong to the same iJ-congruence 
class. 

An E -unification problem over S is a system of equations £" = {.si = ii, . . . , s„ = 
tn) between S-terms. A substitution jj, which is an iS-unifier of all the equations in 
£ is said to be an £'-unifier (or an E-solution) of £. The set of all the E'-unifiers of 
£ is denoted by Ue{£)- 

Let E be an equational theory and W a set of variables (W C V). Ue{£) can be 
sorted with respect to the pre-order given two substitutions (Ti, fT2: 

ci <^ (^2 iff there exists a substitution A such that 
(72 (X) =E (cti o \){X) for all X in W. 

In this case we say that ci is more general modulo EonW than a2- If <Ji <^ ui 
and (72 <^ (71, then we say that u\ =^ ui. Whenever W is omitted from 
then W is implicitly assumed to be vars{£). 

While traditional syntactic unification problems between Herbrand terms admit 
at most one most general unifier (mgu), i?-unification problems may not have a 
single most general unifier. In this context, the role of the most general unifier is 
taken on by a minimal complete set of unifiers. A complete set of E-uniRcrs for 
an iJ-unification problem f is a set C of iJ-unifiers (i.e., a subset of Ue{£)) that 
satisfies the additional condition: 

• for each E'-unifier a there exists an element ^ in C such that 6 <e cr. 

A complete set of E'-unifiers C is called a minimal complete set of E-unifiers if it 
fulfills the minimality condition: 

• for any pair /xi, /X2 in C, if fii <e M2, then fii = /i2- 

A substitution cr in a minimal complete set of iJ-unifiers C is called a maximal 
general E-unifier. When C is a singleton set {a} we say that a is the most general 
E-uniRer. If one minimal set of i?-unifiers can be obtained from another one by 
variable renaming and vice versa, the two sets are equivalent and only one of them 
needs to be computed. 

A special form of systems of equations, called the solved form, plays an important 
role in the definition of unification algorithms. An equation e of the form X = t is 
said to be in solved form with respect to a system £ if X does not occur neither in 
t nor elsewhere in £. In this case, X is said to be a solved variable in £. A system 
£ is said to be in solved form if, for all e in e is in solved form with respect to 
£. From a system in solved form {Xi = ti, . . . ,Xn = tn}, it is simple to derive the 
most general E'-unifier [Xi/ti, . . . , X„/f„]. 

E'-unification problems can be classified according to whether their signature S 
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contains free elements (i.e., function symbols that do not occur in E). In particular, 
it is possible to distinguish between: 

• elementary unification^ where the terms to be unified are built only using the 
symbols appearing in the considered equational theory; 

• unification with constants, where the terms to be unified are built using sym- 
bols in the equational theory and additional free constants; 

• general unification, where the terms to be unified are arbitrary terms con- 
taining function symbols which are either free or present in the equational 
theory. 

The unification problem studied in the next section falls in the class of unification 
with constants. The remaining sections consider general unification problems. 

The SUD problem studied in Section 21 is an abstract case of the E-unifiability 
problem (namely, deciding whether or not an i?- unifier exists). In the next sections 
we deal with the SUS problem, i.e., the problem of determining a complete set of 
i?-unifiers of an equation s = i or of a system of equations £. 

6 ACIl with Constants 

According to the classification presented in Section [21 the simplest non-ground set 
terms we deal with are those belonging to the gflat(g) class. In this section we show 
that the SUS problem for this class can be solved by using the solution to the ACIl 
with constants unification problem. 

6.1 Language and Semantics 

Let S — {0, U, ci, C2, . . .} be a signature composed of the binary function symbol U, 
the constant symbol 0, and an arbitrary number (possibly infinite) of free constant 
symbols ci , C2 , . . . 

Definition 8 

An ACIl with constants term is either a variable, a constant, or a S-term of the 
form si U S2, where si and S2 are ACIl with constants terms. 

The properties of the function symbols U and are described by the identities [A), 
(C), (/), and (1) introduced in Sectional Thanks to the associativity property [A), 
ACIl with constants terms can be always written as strings of the form aiU- • •Uam 
where ai is cither a variable, 0, or a constant term Moreover, using (C),(/), 
and (1) we can restrict our attention to terms without duplications of sub-terms 
and without as a sub-term (unless the whole term is 0). 

Flat set terms with variable elements (i.e., flat(g') set terms) are not expressible in 
this language. Indeed the language does not allow us to distinguish individuals from 
sets. Variables in a set term are always interpreted as set variables. Furthermore, 
nested set terms are not expressible in this language IjDovier et al. 20fl0| . 
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6.2 Which Kind of Set Unification 

The ACI\ with constants language aUows us to describe the set unification problem 
for gflat(g) set terms. The SUS problem for this class can be solved using the solution 
to the corresponding ACIl with constants unification problem (defined below). As 
an example, let us consider the gflat(g) unification problem: 

{a, &} U Fi U Y2 = {a, h, c, d} 

The solutions for this problem are those mapping Yi and Y2 to subsets of {a, 6, c, d} 
such that c and d are in the image of Yi or Y2- For instance, [yi/{a, c}, Y2/{a, 6, d}] 
is a solution. Let us consider now the related ACIl with constants unification prob- 
lem: 

fflU&UyiUy2 = aU6UcUd 

In this case, a, b, c, d are not interpreted as set elements. However, thanks to 
the properties of the U operator, the solutions for this problem are closely re- 
lated to those for the gflat(g) unification problem. The solutions for the ACIl 
with constants unification problem are those mapping Yi and Y2 to unions of ele- 
ments of {a, 6, c, d} such that c and d are in the image of Yi or F2. For instance, 
[Fi/aUc, y2/aU6Ud]. 

We formalize this idea by defining a function (•)* that translates gflat(g) set terms 
into ACIl with constants terms as follows: 

({ai, . . . , a„} U Fi U ■ • • U y,)* = ai U • • • U a„ U n U ■ • • U 

(•)* admits an inverse function. The function can also be extended to substitutions: 
a*{X) = {a{X)Y. 

Lemma 1 

cr is a solution of the gflat(g') SUS problem s — t \i and only if a* is a ACIl unifier 
of s* = t*. 



For the proof, see [Appendix A| 



Example 2 

The following are set terms and set unification problems which are allowed in ACIl 
with constants: 

• Ai U X2 U X3 = A4 U Xi 

• aU6UXiUA2 = cU X3— that is ({a, 6} U Ai U A2)* = ({c} U A3)* 

• the first problem of Section |31 (the Chords problem) can be encoded as the 
ACIl with constants problem cUeUgUbb — gUgUeUbbUcLle. 



6.3 Unification Algorithm 

A general algorithm capable of computing a minimal complete set of v4C71-unifiers 

for ACIl with constants unification problems has been presented in (|Baader and Biittner 1988)l 

Given two S-terms s and t the algorithm computes a complete set S of ACIl- 
unifiers for s = i. Without loss of generality, we assume that if only one of the 
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terms is ground, then it is t. The set S can be extracted from a schema of Boolean 
^C/-matrices. Each column of the matrix is associated to a variable in s = t. Each 
row, instead, is associated to new variables that will enter in the solutions. The 
matrix is composed of identity matrices, by matrices of with exactly one column 
set to 1, and by matrices. 

Example 3 

Let us consider the problem: 

51 U U X Ti U Ta U X 

The sets Vi, 1^2, V3, Ci, C2, C3 are computed as in formula (O of Section ^21 ^1 = 
{5i,52}, V2 = {Ti, T2}, V3 = {X}, and Ci = C2 = C3 = 0. Since the given 
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Fig. 3. The y4C/-matrix for the problem S1US2UX ^ Ti U T2 U X 

problem does not involve constants, the matrix is unique (see Figure|3I). Ri, ■ ■ ■ i R9 
are new variables that allow to compactly represent the unique mgu: 



" Si 


1 


R1UR2UR5, 


S2 


1 


i?3 U i?4 U i?6. 


Ti 


1 


R1UR3U Rj, 


T2 


1 


R2 U R4 U i?8, 


_ X 


1 


R5 U RqU RrD RsU Rg _ 



The two I's in a row state that the two variables should have a part in common in 
each solution. For instance, in the first row it is stated that Ri is a part of Si and 
of Ti (in other words, Ri = SiDTi). 

When the problem involves constants, the matrices have also rows for Ci, C2, C3. 
In this case several matrices are non-deterministically generated. Each of them 
describes a solution; their union covers the whole solution space. 

Example 4 

Let us consider the problem: 

Xi U X2 U Z3 = a U 6 
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where Vi = {Xi, X2, Xg}, G = {a, 6}, V2 ^ = Ci 
^C/- matrices for this problem. Two of them are: 



= C.= 



There are 49 



Vi 










X2 


^3 





1 


1 


1 





1 



C2 



Vi 


Xi 


X2 


^3 








1 





1 






C2 



yielding the unifiers: ^2/0, Xg/a U 6], [Xi/0, X2/6, Xs/a]- 

The number of ^C/-matrices to be computed for a given ACI\ unification prob- 
lem is (2l^2l - 1 + |F3|)l^il(2l^il - 1 + |y3|)ic^l(2l^il + 2l^^l - which is 
0(2(ll''ll+ll*ll)') HBaader and Biittner 1988ll . 

The detection of a solution of a unification problem (i.e., solving the SUS prob- 
lem) clearly implies solving the related decision problem. Thus, the complexity of 
finding a solution can be no better than the complexity of solving the corresponding 
decision problem. In this case, both the problems can be solved in linear time (with 
the assumption in Remark^!. This can be achieved as follows. First verify that the 
decision problem s = t has a positive answer; this can be done in linear time thanks 
to the results in Lemma^and Section 11?^ If the test succeeds, then a solution can 
be constructed by assigning to each variable X in s = t a, term composed of the 
union of all the constants present in s = t. For further details the reader is referred 
to iBaader and Biittner 1988,1 . 



6.4 Discussion 

A simpler unification problem — called the elementary ACIl unification problem — 
has been considered in the literature. This problem involves terms which are con- 
structed using only variables, the constant 0, and the binary constructor U (i.e., a 
subcase of gflat(g) with n = and (? > 0). This problem is simpler in the sense that 
the decision problem has always a positive answer — i.e., each unification problem 
s = t has a solution. Therefore, the complexity of finding an arbitrary solution 
is 0(1). Furthermore, each elementary ACIl unification problem admits a single 
most general unifier. In |Appendix B| we show a variant of the ^C7-matrices for this 
simplified problem. 

As a final remark, ({Hermann and Kolaitis 1997l|Kapur and Narendran 1992| ) show 
how the result presented in this section can be extended to provide a polynomial 
time solution to systems of ACIl with constants unification problems. 



7 General {Ab){Ce) Unification 

Set terms involving variable elements and/or nested sets are not expressible in the 
language of ACIl with constants (see Section IH^ . The proposal we describe in this 
section is intended to enlarge the domain of discourse to the more general class of 
nested (g) set terms with g < 1. As already observed at the end of Section lOl in 
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this case we can rely on the element insertion operator {• | •} as the set constructor 
for the concrete implementation of sets. This choice allows the presence of at most 
one set variable in each set term, while ACIl with constants does not place any 
restriction on the number of set variables which can occur in each set term. On 
the other hand, it allows us to represent nested sets — which is not possible using 
ACIl with constants unification. Moreover, it allows sets to be viewed and ma- 
nipulated in a fashion similar to lists. As a matter of fact, this approach has been 
adopted in a number of logic and functional-logic programming languages (e.g., 
CLP {S ST) UDovier and I^ ossiloMI IDovier et al~nOO,l . SEL ( .Jayaraman 1992} , 
SETA ( |Arenas-Sanchez and Rodriguez- Artalejo 2001| )). 

The unification algorithm we propose here is similar to the one presented in l|Dovier et al. 1996jl — 
but with a considerably simpler termination proof. The underlying equational the- 
ory contains the two identities {Ab) and (C£) shown in Sectional stating the fun- 
damental properties of the set constructor {• | •}. 

7.1 Language and Semantics 

S is a signature containing the binary function symbol {• | •}, the empty set constant 
symbol 0, and an arbitrary number (possibly infinite) of free function symbols with 
arbitrary arities. 

Definition 9 

An {Ah){Ct) set term is either a variable, or the constant 0, or a S-term of the 
form {t I s}, where i is a E-term and s is an {Ab){Ci) set term. An individual term 
is either a variable or a S-term of the form /(ii, . . . , t„) with / ^ {• | •}, / ^ and 
ti, . . . ,tn are S-terms (if n = it is a constant term). 

The function symbol {• | •} has the properties described by the identities (Ab) and 
{C£) introduced in Section^ Hence, set terms denote hereditarily finite sets based 
on U, while individual terms denote arbitrary elements of the universe U. As a 
notational convenience { si | { 52 | • ■ • { | i } • ■ ■}} will be written as {si, . . . , s„ | 
or simply as {si, . . . , s„} when t is 0. 

7.2 Which Kind of Set Unification 

The {Ab){C£) language allows us to describe the SUD and SUS problems for 
nested(l) set terms — i.e., arbitrary nested sets with at most one set variable per set 
term. In particular, the language allows us to deal with all classes of set terms that 
are included in nested(l), namely ground, gflat(l), and flat(l). 

Example 5 

The following are set terms and set unification problems which are allowed in 
iAb)iCi): 

. {x,{r}} = {z,0} 

• {{Xi, a} I Yi} ~ {X3 I Y2} (i.e., in abstract syntax — cf. Sectional— {{Xi, a}}U 
Yi = {X3} U Y2) 



Set Unification 



25 



• the Graph coloring problem of Section can be encoded as an {Ah){Ct) 
problem: 

{{Zi, X2}, {X2, X3}, {X3, X4}, {X4, Xi} I R} - {{ci, C2}, {ci, C3}, {C2, ca}} 

On the other hand, the problem ^Ui?UC = {a}UD cannot be expressed in this 
framework. 

7.3 Unification Algorithm 

The algorithm consists of three parts. The first part (AbCl_unif y — see Figure 0)1 
chooses one equation at a time using a semi-deterministic strategy. The second 
part (AbCl_unif y_actions — see Figure IS)) performs the rewriting of the selected 
equation. A final processing of membership equations, i.e., equations of the form 
X — {to, . . . ,tn \ X} with X vars{to, . . . , i„), (AbCl_unif y_f inal — see FigureEJ 
constitutes the third and final part of the algorithm. 

The system £ is split into three parts: £s is the solved form part (initially set to 
empty), £ns is a system of equations (initially set to the input system £in), and £aux 
is a system of equations dealt with as a stack. For £^aux we assume the existence 
of a push operation that puts an equation on the top of the stack and of a pop 
operation that returns and removes the equation on the top of the stack. Given a 
system of equations £in, the algorithm non-dcterministically returns either fail or 
a collection of systems in solved form. 

AbCl_unify(£'in) : 

• J ^^ns • ^in j ^aux • > 
£ . — (£^s } ^ns t ^aux) J 

while Ens 7^ or £anx 7^ do 

if faux / then e :— pop (faux) 

else select arbitrarily an equation e from fns and remove it ; 
AbCl.unify.actionsCf, e) ; 
AbCl_unify_final(f ) 



Fig. 4. General {Ab){C£) Unification Algorithm (main) 



In the algorithm we make use of the function tail, defined as follows: 

t if t is a variable or a term f{ti, . . . , i„), / ^ {• | •} 

tail(t2) if t is {ti 1 12} 



tail(t) 



For instance, if s = {a, b}, namely s = {a | {& | 0}}, then tail(s) — 0. 

The core of the unification algorithm (FigureEJ is very similar in structure to the 
traditional unification algorithms for standard Herbrand terms (e.g., (|Martelli and Montanari 1982|l '). 
In particular, rule (1) is also known as the Trivial rule, rule (2) as the Orient rule, 
rules (3) and (4) are the Occurs Check rules, rule (5) is known as the Variable Elim- 
ination rule, rule (6) as the Symbol Clash rule, and rule (7) as the Term Decompo- 
sition rule (Jouannaud and Kirclmer 1991|l . The main difference is represented by 
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AbCl_unify_actioiis(f , e) : 
case e of 



(1) 




X = X 


' ^ ^ns • — ^ns 


(2) 




t = x \ 

t is not a variable J 


1-^ £ns := fns /\{X = t) 


(3) 




X=f(h,...,U) \ 
■} and X occurs in f{ti, . . . ,tn) J 


H- > fail 


(4) 




X = {h,...;tn\t} ] 

• •} and X occurs in t {X ^ t), > 
or X occurs in to,. ■ ■ ,t„ J 


H-> fail 


(5) 




X = t \ 
X does not occur in t } 


^ £,[X/t\h{X = t); 


(6) 




f{si,...,Sm) = g{ti,. . . ,tn) \ 

f^g 5 


1— »■ fail 


(7) 




/(S1,...,S„) = f{tl,...,tn) \ 


1— » fns := fns A (Sl = £l A . . . A Sn = tn) 


(8) 




{t\s} = {t'\s'} 


AbCl_step(£',{£|s} = {£'!«'}) 



AbCl_step(£', I s} = {i' | s'}) : 

if tail(«) Eind tail(s') are not the same variable then choose one among: 

(i) fns := fno /\{t = t');pUSh(s = S' , faux) 

{ii) Ens ■■= fns ^{t = <');pUSh({t I s} = s' , faux) 

(iii) Ens ■■= fns A (£ = i');pUSh(s = {t' \ s'} , faux) 

(iv) pUSh(s = {t' I N} , faux); pUSh({< \N} = S' , faux) 

N new variable 

else let {t\ s} = {to, ■ ■ ■ ,tm\ X} and {£' | s'} s {t^, . . . , | X}, X variable; 
select arbitrarily i in {0, ...,n}; choose one among: 

(*) fns := fns A (to = ij); P^sll({*l> • ■ • , I X} = {to, . . . , £,'_!, ...,41 }i £^aux) 

(m) fns := fns A (to = t ■ ) ; PUSh({ , . . . , U I X} = {to, . . . , + . . . , <^ I X}, faux) 

(iii) fns := fns A (to = f,'); PUSh({tl, . . . , tm ] X} = {to, . . . , t'n I X}, faux) 

irv) pUShfX = {f„ I X}- faux): push f{fl f,„ | ,Y } = {f,', | X}. faux) 



AbCl_unify_f inal(f ) : 

Repeatedly perform any of the following actions; 
if neither applies then stop with success; 



X = {iO, . . . , CI X} A . . . A X = {£o^ . . . , 4 I X} A f 
(1) fc > 0, the number of all membership equations involving X 
X does not occur in <o , . . . , , . . . , io , • • . , 

X = {*0i ■ ■ ■ ,t%^, ■ ■ ■ ,'to , ■ ■ ■ ,'tnk I ^} A £[X /{t^, . . . ,t1^, . . . ,t^ , . . . ,tn^. I N}] 

fo\ X = tA£ \ . 

A occurs m r 



Fig. 5. General {Ab)(C£) Unification Rewriting Rules 
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the presence of rule (8), whose aim is the reduction of equations between two set 
terms. Reduction of this kind of equations is performed by the procedure AbCl_step 
(see Figure ISjl that implements the two identities {Ah) and {Ctj. {Ah) and {Ctj 
are equivalent, for terms denoting sets, to the following axiom IjDovier et al. 1998)l : 

/ {yi\Vi} = {Y2\V2} ^ \ 
{Yi =Y2hVi = V2)y 
{Yi =Y2AVi = {Y2 \ V2})y 

{Yi = r2A{ri| Vi}= V2)y 

3K {Vi = {Y2\K] A V2 = {Yi\K]) ) 

which can be easily converted into a rewriting rule to be used in the unification 
algorithm. {E^) is in a sense a "syntactic version" of the extensionality axiom, 
which allows the extensionality property to be expressed in terms of only equations, 
without having to resort to any membership, universal quantifiers, or inclusion 
operation. {E^) allows also to account for equations of the form 

{io, ■ ■ ■ ,tm \ X} = {to, • • ■ , I X} , 

where the two sides are set terms with the same variable as tail element. Unfor- 
tunately, a blind application of the rewriting rule obtained from {E^) would lead 
to non-termination in this situation. This is the reason why this case has been iso- 
lated and dealt with as special in the algorithm (within the procedure AbCl_step), 
actually splitting the rewriting rule obtained from {E^) into two distinct rules. 

A call to AbCl_step introduces equations in the stack faux that are immediately 
processed. This generates a deterministic sequence of actions. We refer to the se- 
quence of actions performed until the stack becomes empty as the global eSect of 
AbCl_step. 

Membership equations, i.e., equations of the form X ~ {to, . . . , | with 
X ^ vars{to, . . . , tn), are not dealt with by any rule of AbCl_unif y_actions. This 
kind of equations turns out to be satisfiable for any X containing to, . . . ,t„ since 
duplicates are immaterial in a set thanks to {Ah) and {C£) (this justifies the name 
membership equations). These equations are processed at the end of AbCl_unif y by 
the procedure AbCl_unif y Jinal. Also, observe that the occur-check test performed 
by the standard unification algorithm is modified accordingly, so as to distinguish 
this special case from others (rules (3) and (4)). 

Correctness and completeness of the algorithm presented in this paper derive im- 
mediately from the similar algorithm of IjDovier et al. 1996|l . The termination proof 
for this algorithm, however, turns out to be simpler than that in l|Dovier et al. 1996)> . 
since here we rely on a more deterministic strategy, and we provide a separate treat- 
ment of membership equations. Basically, in the algorithm of this paper we avoid 
the repeated application of the rewriting rule: 

X ^{to,...,t^\X}^ X = {to, ...,t^\N} 

that increases the number of variables in the algorithms in l|Dovier et al. 19961 
IDovier et al. 2000}! . This change allows the number of variables in the system to be 
kept under control. The simpler termination proof can be found in [Appendix A| 
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Example 6 

Let us consider the unification problem 

{Xi,X2,X3} = {a,&,c} 

(i.e., {Xi I {X2 I {X3 I 0}}} ^ {a\{b\{c\ 0}}}). The algorithm AbCl.unif y returns 
the following six independent solutions that constitute the minimal complete set of 
i?-unifiers for the given unification problem: 

Xi = a, 6, X3 = c Xi = c, X2 = a, X3 = & 

Xi = a,X2 = c,Xi = b Xi = b, X2 = c,X3 = a 

Xi = b, X2 = a, X3 = c Xi = c, X2 = b, X3 = a 

In general, the algorithm AbCl_unif y may open a large — though finite — number of 
alternatives, possibly leading to redundant solutions. IjArenas-Sanchez and Dovier 19971 
[Stolzenburg 1999| ) show how to improve the algorithm to minimize the number of 
redundant unifiers. 



7.4 Discussion 

The problem of finding solutions we tackle here extends the satisfiability problem for 
set unification (i.e., the SUD problem), shown to be NP-complete (c.f. Sect l4.4ll . To 
be precise, we mean that there exists an algorithm on a non-deterministic machine 
that can also find the answer (the correct class is FNP). IjOmodeo and Policriti 1995|l 
proposes a methodology to guess a solution of a conjunction of literals built using 
variables, the constant symbol 0, the function symbol {■ | •} and the predicate sym- 
bols =,e,U,n, and \. The unification problem is the particular case where only 
positive literals based on the equality predicate = are used. A guess is represented 
by a graph containing a number of nodes polynomially bounded by the number 
of variables in the original problem. Verification of whether a guess is a solution 
of the problem can be done in polynomial time. IjOmodeo and Policriti 1995|l also 
shows how this technique can be extended to the general problem with free function 
symbols — the one we deal with in this paper. A non-deterministic algorithm based 
on a "guess-and- verify" technique has also been proposed in ( |Kapur and Narendran 1986| ). 

The algorithm presented here, as well as those in l|Dovier et al. 1996llArenas-Sanchez and Dovier 1997|l 
have the common drawback that, due to the explicit application of substitutions 
during the solving process they have a computational complexity which falls out- 
side of the FNP class. Nevertheless, it is possible to encode this algorithm us- 
ing well-known techniques — such as multi-equations or graphs with structure shar- 
ing iMartell i and Montanari 19821 [Paterson and Wegman 1978| ) — that allow us to 
maintain a polynomial time complexity along each non-deterministic branch of the 
computation. For instance, in l|Aliffi et al. 1999jl a goal driven algorithm in FNP 
for non-well-founded and hybrid sets has been presented. In that paper it is also 
shown how to use the algorithm for well-founded sets, to solve the problem dealt 
with in this section. A similar result is presented in IjDantsin and Voronkov 199911 . 
A detailed discussion of such kinds of enhancements, however, is outside the scope 
of this paper. 
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As far as the size of the computed complete set of unifiers is concerned, we can 
observe that the algorithm opens, for each level of nesting, a number of alternatives 
equivalent to the number of solutions returned by the global effect of AbCl_step. 
This number is no greater than the size of the minimal complete set of {Ah){Ci)- 
unifiers for the problem: 

{Xi,...,X^|M}-{X^+i,...,X„|iV} 

This value has a rough upper bound equal to 0(2" Is ") ^ renas-Sanchez and Dovier 1997|l . 
Since this process can be repeated once for each nesting, a rough upper bound to 
the number of solutions is 0(2" 's")^ 

Various authors have considered simplified versions of the {Ah){C^) problem 
obtained by imposing restrictions on the form of the set terms. Most notable is 
the use of sets in the context of relational and deductive databases l|Liu 19981 
lAbiteboul and Grumbach. 1991l|Naqvi and Tsur 1989||Lm and Ng 1997| ). Typical 
restrictions which have been considered are flat and completely specified set terms, 
i.e., elements either of the gflat(g) or flat(O) classes. Specialized algorithms have been 
provided for some of these cases. In particular, various works have been proposed 
to study the simpler case of matching and unification of Bound Simple set terms 
Hdreco 199611 . i.e., elements of flat(O). These restrictions are sufficient to make the 
task of deciding unifiability between set terms very simple — as also discussed in 
Section m 

Let us illustrate the results in the simple case of matching l|Arni et al. 1996|l 
(the approach has been generalized to sequential unification in IjGreco 1996|l and 
to parallel unification in ( |Lim and Ng 1997| )). In the case of matching, the two set 
terms s and t to be unified can be written as: 

S = {Ci, . . . , Cr, Xi, . . . ,Xh} t = {bi, . . . ,bk, Ci, . . . , Cr} 

where, according to our notation (see Section H?^ . hi E C2, Ci ^ C3, and Xi £ V\ 
(Oi = otherwise the problem has no solutions). The two terms unify iff /i > 
(see Section . From l|Arni et al. 1996jl we know that the number of solutions is 

E(-ir(-)(fc + 

The set of substitutions representing the correct solutions of the matching problem 
s = t can be obtained by: 

• computing all the /i-multisets of {61, . . . , ci, . . . , c-r} that contain all the 
elements of the set { &i , . . . , 6^ } 

• computing all the distinct permutations of each multiset. 

An algorithm based on this approach is optimal, in the sense that it computes 
exactly a complete and minimal set of unifiers, with a complexity that is linear in 
the size of such set of unifiers. 
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8 General ACIl Unification 

The unification problem considered in Section is capable of dealing with flat 
set terms containing an arbitrary number of set variables. On the other hand, the 
unification problem of Section|Z|allows unification between possibly nested set terms 
with at most one set variable per set term. The goal of this section is to provide a 
solution to unification problems which do not fall in any of the two above categories, 
namely, unification problems in presence of set terms which can be nested at any 
depth and which may contain an arbitrary number of set variables. We will refer 
to this kind of problems as general ACIl unification problems. 

We propose a novel solution that combines the algorithms of Sections El and 
developed for solving ACIl unification with constants and general {Ab){C£) unifi- 
cation. The result is a new goal-driven algorithm for general ACIl unification. 

8.1 Language and Semantics 

We consider a language whose signature E contains the constant 0, the binary 
function symbol U, and a (possibly infinite) collection of free function symbols with 
arbitrary arities. 

Definition 10 

An ACIl set term is either a variable, or the constant 0, or a S-term of the form 
iUs, where t and s are E-terms. An individual term is either a variable or a S-term 
of the form /(ii, . . . , t„) with / ^ U and / ^ and ti, . . . , i„ are S-terms (if n = 
it is a constant term). 

The function symbols U and have the properties described by the identities (^), 
(C), (/) and (1) introduced in Sectional Hence, set terms denote hereditarily finite 
sets based on lA, while individual terms denote arbitrary elements of the universe 
U. 

In the rest of the discussion we will assume the existence of at least one function 
symbol / G S of arity greater than zero — note that if such symbol does not exist, 
then we are in the case discussed in Section Intuitively, terms based on such 
symbol will be used to encode singleton sets. Without loss of generality we assume 
to use the unary function symbol {•} to represent singleton sets (more generally, if 
the chosen symbol / is of arity n, n > 0, we could assume that the term/(s, 0, . . . , 0) 
is used to denote the singleton set containing the element s). In this way, it will be 
possible, for instance, to distinguish the individual element a from the set containing 
a (i.e., {a}). Moreover, as a notational convenience, we will denote the term {si} U 
• • • U {s„} with {si, . . . , s„}. 

8.2 Which Kind of Set Unification 

The general ACIl language allows us to describe the SUD and SUB problems for 
any abstract set terms in set(m, n, p, q). In particular, the cases flat((7) and nested((7) 
with q > 2 are handled in this framework (and not in any of the previous ones). 
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Example 7 

The following are set terms and set unification problems which are allowed in general 
ACIl: 

• {{A, B}UCUD}UEUF = {{X, IjjUEUG 

• {{g{a)}UX}U Z = {b}U TU S 

8.3 Unification Algorithm 

In this section, we propose a novel algorithm to directly solve the general ACIl 
unification problem. The algorithm is composed of a main procedure (general_aci) 
and a rewriting function (aci_step), which deals with equations between set terms 
(see Figure IH)). 

The structure of the main procedure is very similar to the structure of standard 
unification algorithms for the Herbrand case. The algorithm maintains two separate 
collections of equations, £s and £ns- the first collects the equations in solved form 
while the second contains the equations that require further processing. As in the 
case of {Ah){Ct) unification, the main changes with respect to standard Herbrand 
unification are concerned with the two rules dealing with set terms (i.e., terms 
containing occurrences of U at the outermost level): 

• rule (5) which is aimed at dealing with equations of the form X = ■ ■ -UX which 
are satisfiable in the case of ACIl theory, whereas they were not satisfiable if 
the U symbol would be uninterpreted; 

• rule (8) which is used to solve equations between two set terms. 

We will use the notation s to denote the list of terms si , . . . , s„ , and s — t to denote 

S\ — , . . . , Sn — tn • 

aci_step receives as input the equation between set terms to be solved and 
non-deterministically produces as result two systems of equations (corresponding 
to the £s and fns of the main unification procedure) and a substitution. aci_step 
performs its task in four successive steps, as shown in Figure El Term Propagation 
is the only (don't know) non-deterministic step of the whole algorithm. Both Term 
Propagation and Variables Removal can lead to a failure for some of the non- 
deterministic choices performed within Term Propagation. Let us analyze these 
steps in more detail. 

Normalization: 

input: A system consisting of the single equation 

/i(Ii)U---UA,(4JULiU---ULfc, = gi{fi)U- ■ ■UghArh^)^Ri^- ■ -^RhM 

where Li, Rj (0 < i < fe, < j < fe) are variables and fi, gj {0 < i < ki, 
< J < /ii) are function symbols different from U. 
output: A system 

= iV/' = /i(Ii)A---AiV,^ =/,,(7,J A 

^ giih) A ■ ■ ■ A N,^^ ^ gh,{fh,) A 

iV/- U • ■ • U iV^^ U Li U ■ • ■ U Lfc2 = A^i" U • ■ • U A^^ U i?i U • ■ • U 



32 



Agostino Dovier, Enrico Pontelli, and Gianfranco Rossi 



where N/" and N/^ are new distinct variables. 

^^^^^^^^^^^^ 

This step, following the idea used in IjLincoln and Christian 1989IIBaader and Schulz 1996|l . 
performs a normalization of the problem £ into the problem — producing an 
equation between set terms that contains only variables. 
Elementary ACIl Solution: 

input: The system produced by the Normalization step; 

output: A pair of systems S^*^'^ and £2'^^ obtained by solving the elementary 

ACIl unification problem 

U ■ ■ -U Ni[U LiU ■ ■ -U Lk^ ^ Nf^U ■ ■ -U Ni^^U RiU ■ ■ -U (5) 

of 5". This problem can be directly solved by using the techniques seen in Sec- 
tion El (see also Example . The result of the computation is a collection of 
equations of the form V — Aaj^^hi ^ Aa2.b2 U • ■ • where F is a variable occurring 
in the two terms to be unified and Aa-^^b-^, Aa2,b2i ■ ■ ■ n^w variables generated 
by the unification algorithm. The solved form equations associated to Lj and Ri 
form the set £^'^'^ . £^'^'^ is composed of the equations concerning the variables 
Nj^ and Nf-. These variables are immediately replaced by the terms they have 
been set equal to during the Normalization step. 
Term Propagation: 

input: The pair of systems £f''^'^ and £2'^^ produced in the previous step; 
output: A pair of systems £1 and £2 ■ 

The equations in £2'^' can be simplified using the semantic properties of and 
U. As a matter of fact, the equations in £2'^^ can be immediately satisfied by 
binding each Aij appearing in the right-hand side of an equation either to or 
to a term which unifies with the left-hand side of the equation. Observe, however, 
that each Aij can occur in the right-hand side of more than one equation; thus, 
it should receive a consistent binding in order to satisfy £2'^^- 
More precisely, a substitution A describing the solution of the equations in £2'^'^ 
can be build as follows. Let us assume that an ordering has been fixed on the 
equations in £2'^'^ and on the variables Aij. Thus, for each Ai,j occurring in 
£2^^^ we can identify an equation j which contains the "first" occurrence of 
such variable in its right-hand side. If /(s) is the left-hand side of such equation, 
then X{Aij) is non-deterministically defined to be either 

• A(^,j) = or 
. A(^,j)=/(s). 

As soon as the value of X{Aij) has been determined, the substitution is immedi- 
ately applied to £2'^' ■ Once all the Ai^j occurring in £2^' have been processed, 
the system is reduced to a collection of equations of the form: 

/(s) = /i(si)U---UA(s^) 

with h > \ (without loss of generality, we may assume that all the occurrences of 
in the union have been removed, as well as repetitions of the same term) . The 
above result also relies on the assumption that at least one Ai,j per equation is 
assigned a term different from 0. 
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If some of the fi is different from / for some equation, then another guess for 
A must be chosen; if no choice leading to the satisfaction of this condition can 
be made, then the system does not admit solutions. Otherwise, let the output 
system £2 consist of all equations of the form: 

s = si A . . . A s = s/i 

for each equation in £2'^^- 

The other output system, £1, is obtained by applying A to the input system £^'^^ , 
with the usual elimination of and repetitions in the unions. Thus, 

Variables Removal: 

input: The pair £1 and £2 computed in the previous step; 
output: The substitution p. 

From £1 we can directly produce a substitution which allows all variables Lj 
and Ri to be removed. More precisely, this is obtained as follows. Let p^^ and 
pn^ denote the substitutions that respectively replace Lj (1 < j < ki) and Ri 
(1 < i < /ii). In order to guarantee that £1 U£2 admits solutions we need to make 
sure that no cyclic conditions occur. 
Let us define the relation as follows: 

X => Y iff Ye vars{Xpx) 
and let us denote with =>* the transitive closure of 

A necessary condition for the solvability of the set of equations £1 is that 

(vx e {L^,...,Lk,,Ri,...,RhMx xy 

If this test is satisfied, then we can construct a global substitution 

p = PLi o • • • o pL,^ o pn^o---o pn^^ 

which allows all variables {Li, . . . , L^^ , . . . , Rhi} to be removed. 

A detailed description of the algorithms for the Elementary ACIl Solution step 
and the Term Propagation step is reported in [Appendix B| 
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general_aci(f ) : 

£s '■= ^; Sr,s := £ (i.e., the initial system of equations); 
while £ns / do 

select arbitrarily an equation e from £ns and remove it; 

case e of 



(1) 



X = X 



— ns • '-'ns 



(2) 



t = X 

t is not a variable 



^ns ■ — ^ns A (^X — i) 



(3) 



X = t 

t can be re-ordered as 
/i(si) U • • • U /„(s„) U Fi U • • • U 
'I ^ 0, /i ^ U, m > 0, and X G vars(si, . . . , Sn) 



fail 



(4) 



X = t 1 ^ 
X does not occur in i J 

:= 5.[Js:/£] A (X = 0;fn= := £^^[X/i\ 



X = t 

t can be re-ordered as t' VJ X \J ■ ■ ■ VJ X , 
(5) t' =/i(si)U---U/„(s„)U FiU---U K„, 

/i ^ U, m > 0, 
X ^ vars{t') 



£ns := A{X = t'LlN) 
N new variable 



(6) 



/(Sl,. . . ,Sm) = s(£i, 



fail 



(7) 



/(Sl, . . . , Sn) =f{tl, ...,t„) 



fns A (Sl = tl A . . . A Sn = in) 



(8) Sl U S2 = tl U t2 1-^ 

Let {£[,£2,9) be a result of 
aci_step(si U S2 = ti U h); 

£b '•— £3^ A £1'-, £ua '— £na^ A £2 



aci_step(e) : 

~ Normalization(e) ; 
{£i^',£2'^') := Elementary.AC/l-SolutionCf") ; 
{£i,£2) Term_Propagation(5i"^^, £2**^^) > 
p := Variables_Removal(fi, ^2) ; 
return {£i,£2p,p) 



Fig. 6. General ACIl Unification Procedure and the function aci_step 
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Example 8 

Let us consider the unification problem: 

{{a}}U{6}UX {{VK}}U Y\JZ 

The Normalization step leads to the system 

= = {{a}} ANi" ^{b}A N^' = {{ W}} A \J \J X = N^' \J Y \J Z 

The equation Nl' U N2 D X = U Y U Z can be solved and its solution applied 
to the rest of the system {Elementary ACIl Solution step), leading to: 

£^^' = X-^1^3U^2,3U^3.3 A £^^' = {{a}} - ^14 U ^2,1 U ^34 A 
Y = A2,l U A2,2 U ^2,3 A {&} = ^1,2 U A2,2 U ^3,2 A 

Z = ^3,1 U yl3.2 U yl3.3 {{ W}} = ^1,1 U ^1,2 U ^1,3 

A possible substitution A produced by the Term Propagation step is the following: 



Ai.i 


^2,1 ^3.1 


j4i,2 j42,2 ^3.2 ^1,3 


{{«}} 


|{{«}} 1 


1 |{&} 1 1 



This produces the systems 

£2 = {W}^{a} £i=X^ ^2,3 U ^3,3 A F = {{a}} U {b) U ^2,3 A Z = ^3,3 

and the substitution p = [^/^2,3 U ^3,3, y/{{a}} U {6} U ^2,3, •^/^3,3]- From £2 
it is then computed [W^/a]. 

8.4 Results for the General ACIl Unification Algorithm 
8.4-1 Soundness and Completeness 
The soundness and completeness results can be derived as follows. 
Lemma 2 

Given an equation e of the form 

U ■ • ■ U A, (IfcJ U Li U • ■ • U Lfe, = giih) U • ■ • U (f,,J U iZi U • • ■ U i?^, 

let {£{,£2, p'^), for i — 1, . . . ,k, be the collection of all the distinct solutions non- 
deterministically produced by the call aci_step(e). Then: 

• if (T is a unifier of £[ U £2 then cr is a unifier of e and u < 

• if (7 is a unifier of e then there exists 1 < i < k and a substitution 7 such 
that cr U 7 is a unifier of £{ U f 2 ■ 

For the proof, see [Appendix A| 

Theorem 1 

The unification procedure general_aci is correct and complete with respect to the 
general ACIl theory. 
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Proof 

Immediate from the above Lemma |21 concerning the auxiUary function aci_step, 
and from the classical results regarding Herbrand unification for the remaining 
rules. □ 

8.4-2 Termination of genera.l_3iC± 

The development of a termination proof for general unification algorithms for theo- 
ries obtained using some or all of the {A), (C), (/), and (1) axioms is a well-known 
challenging task IjBaader and Schulz 1996|l . In HI* 'ages 19^ Fages proposed a ter- 
mination proof for general AC unification. The complexity measure developed by 
Fages to prove termination, however, is not applicable to our problem — mainly due 
to the need, in our algorithm, to introduce new variables to handle cases such as 
X — Y \J X, that are unsatisfiable in but admit solutions in ACIl. 

The detailed termination proof (TheoremEl) is reported in [Appendix A| We give 
here the main ideas behind that proof. First of all, aci_step replaces an equation 
between two sets with equations between members of the sets, thus with equations 
of a "lower level" . The process cannot enter in a loop thanks to the occur-check test 
which avoids the possibility of generating infinitely-nested sets. To formalize this 
idea we define the notion of p-level (Dei Ml^ . Terms can be naturally viewed as trees. 
We use two kinds of edges in these trees, edges connecting U-nodes to their children 
and edges linking all other types of nodes. We show how the unification algorithm 
operates on this tree representation of terms, and we determine some properties 
related to cycles involving edges of the second type (-|-l-edges). Finally, we define 
a complexity measure built from the notion of p-level of the terms occurring in 
the system of equations. We show that this measure is well-ordered and that any 
given sequence of applications of rules either decreases it, or an occur-check failure 
is detected. 

8.5 Discussion 

A non-deterministic algorithm for general ACI is presented by Kapur and Naren- 
dran in | |Kapur and Narendran 1992 1) that can be adapted to general ACIl. An- 
other algorithm for general ACIl unification can be obtained as an instance of the 
general technique of Baader and Schulz l|Baader and Schulz 1996|l for combining 
unification algorithms. Combining unification procedures for different unification 
problems has been a major topic of investigation for years l|Siekmann 1984|l . Vari- 
ous proposals have been put forward to allow combination of unification procedures 
under different conditions on the equational theories IjYellick 19851 IHerold 19861 
ITiden 1986IIKirchner 198 9I ISchmidt-Schaufi 1989 >). In fBaade r and Schulz 1996j l Baader 
and Schulz proposed a general technique for combining unification procedures over 
disjoint theories under very simple restrictions — i.e., constants restriction. In the 
context of general ACIl unification, we need to combine two theories: the the- 
ory ACIl for and U, and the empty equational theory for all the other function 
symbols. The technique proposed by Baader and Schulz can thus be used to inte- 
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grate the unification procedure for ACIl with constants and a standard Herbrand 
unification algorithm to obtain a unification procedure for general ACIl. 

Let us briefly compare these two proposals with the unification algorithm for 
general ACIl presented in this paper. All three unification procedures start with 
a Normalization step (implicit in ( |Kapur and Narendran 1992| )). New variables are 
introduced for subterms. As an example, the problem 

{X}U{r} = {a}U{&} (6) 

is rewritten as 

U = U N^, = {X}, = { Y}, = {a}, = {h} (7) 

All three procedures introduce don 't know non-determinism. In particular, l|Baader and Schiilz 1996|l 
introduces non-determinism in steps 3 and 4, where 

• step 3 computes an arbitrary partition of the variables in independent sets 
(all the variables in the same component of the partition will be aliased to 
each other in the final solution); 

• step 4 imposes an arbitrary order over the elements of the previously com- 
puted partition. 

In the formulad for instance, there are 6 variables. Therefore, there are {i} = 

203 possible partitions of the set of variables^, and 6! = 720 possible strict orderings 
among the 6 variables. Actually, the problem has only two independent solutions 
X ^ a, Y = b and X = b, Y = a that suggests the need of only 2 non-deterministic 
choices. The high number of choices in IjBaader and Schulz 1996|l derives from the 
generality of the combination procedure (which is not specifically tied to the prob- 
lem of set unification). On the other hand, it is unclear whether the instantiation 
of that framework to the problem at hand would actually reduce the number of 
alternatives compared to the algorithm we propose in this paper. 

The unification procedure presented in ( |Kapur and Narendran 1992| ) is rather 
different. It performs a series of non-deterministic guesses for the variables in order 
to find ground substitutions. It has two main practical drawbacks. The first is that 
the number of choices does not depend on the structure of the problem but rather 
on the signature. The second drawback is that the algorithm always returns ground 
substitutions. The number of ground substitutions of a general ACI problem can 
be infinite. Let us consider, for instance, the problem 

{0} u y = y (8) 

Y = {0}, r = {0} U {{0}}, y = {0} U {{0}} U {{{0}}}, ... are all the ground 
solutions for However, a unique most general unifiers, Y — {$}lJN is sufficient 
to finitely describe all solutions (this is exactly what our algorithm returns). Even 
for problems where only ground unifiers are present, our algorithm has the advan- 
tage of using the symbols in the problem to drive the construction of the solution, 
instead of performing a blind enumeration based on the language signature. 

^ {"} is the number of partitions of n elements into i classes, known as Stirling number of the 
second type tGraham ct al. 1994J . 
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As concerns the non-determinism introduced by our algorithm, first observe that 
the Normalization step allows us to call the elementary ACIl unification step with 
terms containing only variables. In this case it is known that the unification problem 
admits a unique mgu. So, we are not exploiting the possibility of the ACIl with 
constants unification algorithm to return non-deterministically all the mgus and we 
perform that choice later. The rationale behind this is that the non-variable terms 
si, . . . , S£, ti, . . . ,tr in an equation XiU- ■ -UXmUsiU- • -Use — YiU- ■ -U F„UfiU- • -UU 
can be compound terms. We do not know (yet) if some of them can be unified, 
and thus we cannot consider them as equal or different constants when calling the 
ACIl with constants algorithm. Possible optimizations of our algorithm include 
the use of ACIl with constants in those cases where a simple preprocessing allows 
us to quickly determine what individuals in the equations are equal or distinct. If 
Vit V2, V3 are the set of variables in the elementary ACIl unification problem as 
defined in Section ^21 then the Boolean ACI matrix IjBaader and Biittner 1988| 
is of size i\Vi\ + \V2\ + I V3|)(|^i||t^2| + l^illVal + \ V2\\V3\ + IF3I) and the new 
variables introduced are | 1 1 V2 1 + | l^i 1 1 V3 1 | V2 1 1 V3 1 | Va | . Our elementary A CI 
matrix (see [Appendix Bj l introduces the same number of variables, but its size is 
(I ^il + I ^3|)(| V2\\ Vsl). For instance, if | | = | F2I = | V^sl = « we need space 4t;2 
against space 9v^ + 3v'^. 

All non-deterministic choices are performed in the Term Propagation step. If 
k is the number of variables introduced by the matrix, this would potentially 
open 2*^ non-deterministic choices. However, using the auxiliary Boolean matrix 
(see [Appendix B| ) wc do not try all these choices, since for each column and each 
row of the matrix for Term Propagation there must be at least one variable which 
is different from 0. This decreases the number of choices. In the case of the system 
of equations (|7|) wc have only 8 non-deterministic choices instead of the 2'* expected 
(and the 203 x 720 of the naive application of the Baader-Schulz procedure). 

As far as the difference in non-determinism between the general ACIl and the 
general {Ab){C£) unification is concerned, we can observe that the ACIl algorithm 
opens, for each level of nesting, a number of alternatives equivalent to the resolution 
of an ACIl with constants problem; this leads to 0(2" ) solutions (see Section IO| . 
Since this process can be repeated once for each nesting, a rough upper bound to 
the number of solutions is 0(2" ). Observe that this number of solutions is greater 
than those computed by the {Ab){C£), namely 0(2"' '°s"). This fact suggests that 
the general ACIl unification should be used only when the problem is really not 
expressible using the general {Ab){C£) unification and the full range of solutions is 
required. 

9 Related Work 

Most of the related proposals have already been discussed throughout the paper. 
In this section we provide a brief overview of other related contributions. 

Boolean unification. Boolean unification is a very powerful framework that al- 
lows one, in particular, to mimic the ACIl with constants unification problems. 
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The richer language of Boolean unification, however, allows the various solutions 
of a given ACIl problem to be encoded in a very compact way, as a single com- 
plex solution — instead of using multiple ^C/-matrices as in Section |H| A funda- 
mental work in this area is ( [Martin and Nipkow 1989| ), which surveys both the 
Boole's method and the Lowenheim's method. The former has been originally 
described in IjBiittner and Simonis 1987|l while the second has been initially de- 
scribed in ( [Martin and Nipkow 1 988). All these approaches deal with Boolean 
unification with constants, where the signature S contains a possibly infinite col- 
lection of constants, which intuitively represent the elements of the universe U. 
The class of terms allowed in this framework extends the one considered in this 
paper by allowing a variety of different operators to be used in the construction 
of sets, such as intersection n and complementation (•). 

The complexity of the decision problem of Boolean unification has been studied 
in ([Baader 1998|l . In the elementary case, i.e., without constants, the problem is 
NP-complete, while in the case with constants the problem becomes PSPACE- 
complete. However, if the input is of the form admitted by ACIl unification, the 
test between two ground terms can be performed in linear time. The computation 
of the unifier for a given Boolean unification problem s = t is based on the fact 
that is a unifier of s = < if and only if is a unifier of s A t = 0, where 
A is a function symbol which is interpreted as the symmetric difference. Thus, 
to solve a unification problem, it is sufficient to solve a matching problem. The 
work in IjBiittner and Simonis 1987|l shows that a unique most general unifier 
is sufficient to cover all the solutions. The generality of this scheme and the 
power of this unification procedure are balanced by the complexity of the answers 
produced — sets built using A are arguably more complex and less intuitive than 
those constructed using U. 

Computable Set Theory. The work on Computable Set Theory ([Cantone et al. 2001|l 

has been mainly developed at the New York University, with the objective of en- 
hancing the expressive power of inference engines for automated theorem provers, 
and for the implementation of the imperative set-based programming language 
SETL ([Schwartz et al. 1986 1. The general problem is to identify computable 
classes of formulae of suitable sub-theories of Zermelo-Fraenkel set theory. In 
this context, the set unification problem is seen as a special case of the satisfia- 
bility problem for the 3*V-class of formulae. As a matter of fact, thanks to the 
extensionality axiom, testing whether two terms s and t with variables Xi, . . . , X„ 
are unifiable is equivalent to testing whether the following holds: 

HF h 3Xi ■ • • 3Xn VZ(Z es ^ z et)- 

Unification algorithms can be obtained by instantiating the general (and com- 
plex) techniques for testing satisfiability of 3*V-formulas ([Dovier et al. |l . 

Set constraints. Set constraints ([Kozen 19981 I Aiken 1994[l are conjunctions of 
literals of the form ei C 62 where ei and 62 are set expressions, constructed 
using variables, constant and function symbols, and the union, intersection, and 
complement of set expressions. Set expressions denote sets of Herbrand terms. An 
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expression identifies a subset of the Herbrand universe. A unification problem of 
the type s = t can be expressed in this framework as the constraint s Q t At C s. 
The framework is sufficiently powerful to solve ACIl unification problems with 
constants; nevertheless, the peculiar interpretation given to terms in the language 
is such to prevent to encode large classes of set unification problems. In particular, 
to represent nested sets in set constraints we need to make use of a distinguished 
functional symbol {•} (as described also in Sect ion 153)) : on the other hand, using 
the set constraint interpretation of expressions, the two expressions {{s, t}} and 
{{s}, {t}} would be mapped to the same set. 
Alternative representations of sets. Other syntactic representations of sets are 
also feasible. For instance a set of n elements can be represented by {}„(io, ■ • • , ^n), 
where {}„ is a function symbol of arity n. This solution requires the introduction 
of an infinite signature, with a different set constructor for each possible finite set 
cardinality. This approach has been adopted, for example, in IjShmueli et al. 1992|l . 
In order to use this solution it is necessary to introduce a complex infinite equa- 
tional theory, capable of specifying the unifiability of set terms with different 
main functors — as in the case {}3(X, Y, Z) — {}2{a, b). 

This representation scheme allows one to express only set terms with a known 
upper bound on their cardinality. Namely, \ {}n{tiT ■ ■ ■ , Ui) \ ^ n ■ 

10 Conclusions 

In this paper we have presented a survey of the problem of solving unification in the 
context of algebras for sets. We have abstractly defined the set unification problem 
and developed the corresponding equational theories, starting from the simpler case 
of with constants and proceeding to the most comprehensive case of general 

ACIl unification. We have presented decision and unification procedures for the 
different classes of unification problems and analyzed their complexity. Complexity 
results, as well as the suitable equational theory for a given set unification problem, 
are summarized in Table ^ The algorithms presented are either drawn from the 
literature or are brand new algorithms developed by the authors. 

We believe this work fills a gap in the literature on this topic, by providing a 
uniform and complete presentation of this problem, and by presenting a comparative 
study of the different solutions proposed. 
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Appendix A Proofs 

A.l Termination of kbCljaaify 

To prove the following theorem, we will use the notions of solved variable and 
solved equation. Given a system £ an equation in £ is solved if it is of the form 
X = t and X docs not occur neither in t nor elsewhere in £. If X is the r.h.s. 
of a solved equation then it is a solved variable. Moreover, size is the function 
returning the number of occurrences of constant and functional symbols in a term 
{size{X) = 0, size{f{ti, t„)) = 1 + JZ'i^i size{ti)). 

Theorem 2 (AbCl_unif y termination) 

For any Hcrbrand system £, and for any possible sequence of non-deterministic 
choices, AbCl_unif y(f ) terminates. 

Proof 

To start, do not consider the final call to AbCl_unif yjEinal. We associate the 
complexity pair {A, B) to a system £, where: 
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A is the number of non-solved variables in £ 

let p = m.ax{size{£) : £ = r in £}. For i = 0,...,p, let ri{i) be the number of 
non-solved equations £ = r in £ s.t. size{£) = i. Then B is the list: [r]{p),r]{p — 
1), . . . , 7/(0)]. We define the ordering among lists as follows: 

X <iist y iff {length{x) < length{y)) or 

(length{x) — length(y) and head(x) < head(y)) or 

{length{x) = length{y), head{x) = head{y), and tail{x) Kust tail{y)) 

where length, head, and tail are three functions on lists returning the length of the 
list, its first element, and the list deprived of its first element, respectively. 

The ordering between two complexity pairs is the lexicographic ordering in which 
usual < is used for the integer numbers of the first argument and <iist for the second. 
It is immediate to prove that this ordering is well-founded. 

We show that each non-failing call to AbCl_unif y_actions causes the decreasing 
of the complexity. Well-foundedness of the ordering implies termination. By case 
analysis, we note that: 

rules 1,2, and 7 cannot increase A, while B always decreases 

rule 5 decreases A 

rule 8 is more complicated to analyze, since it calls AbCl_step(f , {< | s} = {t' \ s'}). 
In this case, equations are added on the part of the system dealt as a stack, driving 
the following rule applications. Those sequences of rule applications always allow 
to empty the stack. We consider these operations as a unique step that removes 
{t\s} = {f \s'} and introduces other equations in the system. Four cases must be 
distinguished: 

1. tail(5) and tail(s') are not variables: in this case A cannot increase and B 
decreases, since the equation is replaced by a certain number of equations 
between the elements of the two sets and between their tails, but all of fewer 
(leftmost) size; 

2. exactly one of them is a variable. Assume tail(s) is a variable: a substitution 
for it is computed and applied: A decreases. The situation when tail(s') is 
a variable is perfectly symmetrical. 

3. tail(5) and tail(s') are the different variables X and Y, respectively. One 
of the following cases happens: 

(a) a substitution X = {. . . \ Y} is computed, 

(b) a substitution Y = {. . . \ X} is computed, 

(c) a substitution X = {. . . \ N} and Y = {. . . \ N} {N a new variable, the 
same for the two equations) is computed. 

In all the three cases the application of the substitution cause A to decrease. 

4. tail(s) and tail(s') are the same variable X. In this case one equation 
X = {. . . \ X} is added to £ together with a certain number of equations 
between elements of the two sets {t\s} and {<' | s'}. All those equations have 
(leftmost) size smaller than {t \ s}. 
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To conclude the proof, let us observe that the termination of AbCl_unif yJinal 
is evident. For any variable X occurring in a equation X = {. . . \ X} we perform 
at most one rewriting and application of substitution. X occurs elsewhere in the 
system only as l.h.s. Equations in solved form remains in solved form and do not 
fire any new action. □ 

A. 2 Correspondence between ACI\ with Constants and gflat(g) 

Unification 

Lemma ^ a is a solution of the SUS problem s — t if and only if a* is a ACIl 
unifier of s* = t* . 

Proof 

Without loss of generality, we assume that symbols in s and t are sorted, so as they 
are of the form 

s = {ai, . . . , a„} U {61, . . . , bnj U Yi U ■ ■ ■ U Yp U Wi U ■ ■ ■ U W,^ 

Ci C3 Vi V3 

t = {di, . . . , d,n'} U {bu. . . , bnj U Zi U ■ ■ ■ U Zp, U Wi U ■ ■ ■ U W^^^ 

C2 C3 V2 Va 

where Ci and Vi are determined according to formula (0) — Section The corre- 
sponding (s)* and (t)* are: 

s = fli U ■ ■ ■ U a,n U bi U ■ ■ ■ U 6n U Y"i U ■ ■ ■ U Yp U W^i U ■ ■ ■ U Wg 

Ci C3 Vi V3 

t = di U ■ ■ ■ U dm' U bi U • ■ • U bn ^ ZiU---U Zp'UWiU---U Wq 
C2 C3 V2 V3 

(T is a solution of s = < if and only if 

for each in Ci there is X in V2U V3 such that a-{X) = {ai, ■ ■ ■} and 

for each bj in C2 there is X in U V3 such that a{X) — {bj, ■ ■ ■} and 

each variable in Vi U V2 U V3 is mapped on a set of constants in Ci U C2 U C3 plus, 

possibly, other constants. 

/i is a solution of s ^ t if and only if 

for each Ui in Ci there is X in V2 U V3 such that = and 

for each bj in C2 there is X in V2L) V3 such that ~ bj IJ ■ ■ ■ and 

each variable in ViU V2U V3 is mapped on a union of constants in Ci U C2 U C3 

plus, possibly, other constants. 

Clearly, ^ = cr*. □ 



A. 3 Soundness and Completeness of General ACIl Unification 

Algorithm 

Lemma |5J Given an equation e of the form 

fiCh) U ■ • ■ U /fe, (IfcJ U Li U • ■ • U Lfc, = gi{fi) U ■ ■ ■ U gn, {fh,) U iZi U ■ • ■ U i?^. 
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let {£l,£2, p^), for i = 1, . . . ,k, be the collection of all the distinct solutions non- 
deterministically produced by the call aci_step(e). Then: 

• if a is a unifier of E\ U E\ then a is a unifier of e and a < 

• if a is a unifier of e then there exists 1 < i < k and a substitution 7 such 
that a U 7 is a unifier of £IU £2- 

Proof 

Let us prove the lemma by showing that the conditions liold at each step of the 
construction of eacli solution. 

For the Normalization step, it is trivial to show that cr is a unifier for e if and only 
if tTU7 is a unifier for where 7 possibly binds the new variables N^^, N^^. In this 
case k is equal to 1. The substitution 7 is [N^^/f,{k)a 1 1 < i < ki]U[Nf^/gj lfj)a \ 1 < 
J < h]. 

For the Elementary ACIl Solution step the result follows from the results for 
elementary ACIl unification I Baader and Biittner 1988|l . In this case we have that 
(7 is a unifier of £" if and only if cr U 7 is a unifier for f where dom{-f) C 
{A,^j \1 < i < hi + h2 A 1 < J < ki + k2}. 

Let us consider the Term Propagation step. We prove that cr' = ct U 7 is a unifier 
for £^'^' (where dom{a) n dom{'-f) = and dom{j) = {A^j \1 < i < hi + h2 A I < 
j < ^1 + fe}) iff o'7' is a unifier for £1 U £2 (where 7' is the restriction of 7 to 

{A,^j \hi + l<i<hi+h2Aki + l<]<ki + ^2}). 

Let cr' = cr U 7 be a unifier for £^'^^ and let us consider the equations e in £2'^'^ 
in the same arbitrary order used to build £2- Such equations have the form /(s) = 
[j Ai\j[j Bj. If ct' is a unifier for £'^'^'^ , then (from the ACI properties and Clark's 
Equational Theory) each Aij and Bjj must be either or f{s)a; furthermore, at 
least one of ^i, Bj must be assigned f{s)a. Let I and J be the collection of indices 
for which respectively Ai and Bj receive f{s)a in ct'. We can use / and J to select 
a certain £1 U £2 — the one in which the A is constructed by taking \{Ai) ~ f{s) 
{X{Bj) = f{s)) for i G I {j € J), and for the remaining variables in the equation. 
The process can be repeated for the remaining equations, until all the variables have 
received an assignment in A. The consistency of ct' guarantees that this construction 
will provide a consistent £2 ■ It is straightforward to observe that ct' is a unifier for 
£2- Observe also that ct' < A, i.e., a' — Xo 9. This last fact, together with the fact 
that CT is a unifier for £1'^'^, is sufficient to conclude that ct' is a unifier for £1. 
Vice versa, let ct' be a unifier for a certain £1 U £2 produced by the algorithm. Since 
the construction was possible, then there is a substitution A which has been used 
to convert £2'^^ into £2- If ct' is a solution of the equations s = si, . . . ,s — Sh 
present in £2, then ct' is also a unifier for the equation /(s) = (IJ U|J Bj)X which 
produced such elements of £2- Thus, ct' U [^/^A o a' \ A G dom{X)] is a unifier for 
gACi _ rpj^g j.gsuit foj. gAci jg obvious. 

Correctness for the Variables Removal step follows from the fact that we are not 
interested in solutions over infinite terms. 

□ 
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A. 4 Termination of General ACIl Unification Algorithm 

Definition 11 

Let 5 be a set of equations, and let us consider a function lev : vars{£) — > N. This 
function can be extended over elements of T{Y^, V) as follows: 

lev{f{ta,...,t„)) = 1 + ina:x{lev{to), . . . Jev(tn)} / £ S, / ^ U 
lev{s U t) — max{lev{s),lev{t)} 

The function lev is said to be a partial p-level if it satisfies the condition: 

(*) lev{t}, lev{r) < p, for any equation I — r \n £. 

Any partial p-level lev is said to be a (complete) p-level if it satisfies also the 
condition: 

lev(£) — lev{r) for any equation £ = r in £. 
Lemma 3 

Let us consider a system of equations £, and let p be the number of occurrences of 
elements of S in f ; then, exactly one of the following conditions holds: 

• there exists a complete p-level for £ 

• for any natural number g, there are no complete g-levels for £. 
Proof 

Given the system £, it is possible to obtain, by adding a suitable number of new 
variables, an equivalent system £' in Rat form, i.e., each equation in £ is in one of 
the following forms: 

1. X = Y 

2. X=/(yi,...,y„),/eEand/^U 

3. X ^ Y1UY2 

Observe that at most p equations of type Q can appear in £' . 

The goal is to map £' to a set of linear integer constraint systems. Each possible 
complete p-level for £' (and thus for £) is a solution of at least one of such systems of 
constraints. Vice versa, each solution of one of these systems can be used to generate 
a complete g-level for £' , for a suitable q. Such mapping is realized as follows: for 
each (term) variable X in £' we introduce a corresponding (integer) variable x] 
then we add equations and disequations according to the following rules: 

X = X if X does not occur elsewhere, then add a; = 

X = Y X = y 

X = /( Fi, . . . , r„), 7^ > ^ YLii^ ^v^ + lA a;=i,,^, %■ < y^) 

X = a X ~ 1 

X = Y1UY2 ^ (a; = 2/1 A 1/2 < 2/1) V (x = ?/2 A 2/1 < 2/2) 

Through simplifications (e.g., distributivity) it is possible to obtain a disjunction 
of systems SiV ■ ■ ■ V Sk, where each system Si contains only equations of the form: 

X = y X — y + 1 X = x — 1 x < y 
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y2 = 

V2 < X 

V2 < yi 
X = yi + 1 
yi = 2/2 + 1 
2/3 < 2; 

Fig. Al. A simplified integer system, the corresponding graph, and a solution 

Furthermore, in each system there can be at most p occurrences of equations of 
the type x = y + 1 and x = 1. Our aim is to show that, if one of the systems Si is 
satisfiable, then there will be one solution a of Si such that for each variable x we 
have a{x) < p. 

Each system Si can be further simplified using the following observations: 

All equations of the form x — I can be eliminated and replaced with the equations 
w = and x = w + 1, where w is a new variable. Note that the total number of 
equations x ^ y + I is still at most p even after this simplification. 
The equations of the form x ~ y induce an equivalence relation on the variables. 
We can remove all these equations and replace each occurrence of each variable in 
Si with a selected representative from its equivalence class. 

For each Si we can construct a labeled graph — {vars{Si), Ei) as follows (see 
example in Figure IXTl thick lines are used for +1 edges and thin lines for edges): 

for each equation of the form x = y + \ in Si^ generate an edge (y, x) with label +1 
for each equation of the form x < y 'm Si, generate an edge {x,y) with label 0, 
unless there is already an edge {x, y) with label +1. 

If contains a cycle with at least one edge labeled +1, then Si will not admit 
solutions. Cycles in Gs^ composed only of edges of type denote an implicit equality 
between the nodes in the cycle — thus we can collapse the cyclic component. These 
observations allow us to focus only on directed acyclic graphs. 

A solution of Si can be described as a labeling of the nodes of the graph. A 
consistent labeling <t of the nodes of the graph representing a solution should fulfill 
the following conditions: 

if (x, y) is an edge of type +1, then (t(j/) = (7{x) + 1 
if {x, y) is an edge of type 0, then a{x) < (7{y) 
if there is an equation a; = in 5^, then a{x) — 0. 

We claim that if the Gs^ admits a labeling with the above properties, then Gg., 
also admits a labeling (j) of the nodes such that for each node X we have (piX) < r, 
where r is the number of +1 edges in Gs^ — in particular r < p. 

Let us develop a proof by lexicographical induction over the measure {A,B), 
where A is the number of +1 edges and B is the number of edges in the graph. 

(0, 0) In this case the graph is composed only of disconnected nodes, and the origi- 
nal system Si contains only equations of the form 2; = 0; the solution a such that 
a{x) = for each node a; is a consistent 0-labeling. 
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(m, n) Let x be an arbitrary node of Gs, with no outcoming edges, (wi, a;), . . . , {vh, x) 
incoming edges of type +1, and {wi, x), . . . ,{wk,x) incoming edges of type 0. With 
no loss of generaUty we assume h + k > 1. Let us distinguish the foUowing cases: 

1. h = 0: consider the graph Gg^ obtained by removing node x and aU its 
incoming edges (ah of type 0). The measure for the graph Gg is (m, n — 
k). By inductive hypothesis, there is a consistent m-labehng a of G'g.. a 
can be extended to a consistent m-labehng of Gg,^ by assigning aix) = 
max{(T(i(;i), . . . ,cr(wfc)}. 

2. h > 1 and fc > 0: in each consistent labeling of Gsi we must have that 
a(vi) = • • ■ = a{vh) — o-{x) — 1. Let us consider the graph G'g. obtained 
by collapsing nodes vi, . . . ,Vfi into a single node v. The measure of G'g. is 
(m — /i + 1, n); thus, by inductive hypothesis, it is possible to determine a 
consistent (m — fc + l)-labeling a of Gg. . a can be extended into a consistent 
(m — fc + l)-labeling of Gs^ by defining a(y\) = • • • = aivh) = ""(w). By 
definition u is also a consistent m-labeling of the graph. 

3. /c = and h = \: consider the graph G'g. obtained by removing X and 
its incoming edge. The measure of G'g. is (m — thus, by inductive 
hypothesis, there is a consistent (m — l)-labeling a of such graph. This 
labeling can be extended to a consistent 771-labeling of Gs^ by defining 
a{x) — cr(wi) + 1. 

4. A; > and /i = 1: in each consistent labeling of Gg^ we must have that: 

(a) a{y\) = o[x) — 1 

(b) a{wi) < a{x) for i = 1, . . . , k, thus a(wi) — cr{x) or a{wi) < a{vi) 

Let us consider the following class of simplified graphs: we arbitrarily par- 
tition {wi, . . . , Wk} into two subsets Bi, B2 and we consider the graph ob- 
tained by: 

• removing all edges (wi, x) 

• collapsing all nodes in i?i U {2;} 

• adding the edges {wi, vi) for each Wi G B2 

• if i?i = 0, then the node x and the edge (wi, x) are removed. 

The two properties (|4a(l and (|4b|) guarantee that each consistent labeling 
of Gsi is a consistent labeling of at least one of the simplified graphs, and 
each consistent labeling of a simplified graph can be extended (see below) 
to a consistent labeling of Gsi ■ Since we are under the assumption that 
Gsi admits consistent labelings, at least one of the simplified graph admits 
consistent labelings. The measure of each simplified graph is (m, n — \Bi\) 
if Bi ^ 0, {m — otherwise. By inductive hypothesis we can build 

a consistent m-labeling (or (m — l)-labeling in the last case) a for such 
graph. If i?i 7^ 0, then a can be extended to a consistent m-labeling of 
Gsi by defining a{wi) — a{x) for each Wi £ Bi. Otherwise, a consistent 
m-labeling of Gsi is obtained by defining a{x) — a{vi) + 1. 



□ 
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The notion of p-level has a direct interpretation on a graph-encoding of the 
system of equations. The unification algorithm itself can be mapped on a collection 
of graph manipulation operations. The mapping of the unification algorithm on 
graphs allows us to intuitively demonstrate that for each intermediate system of 
equations during the unification process it is possible to determine a partial p-level 
(where p is the number of occurrences of elements of S in the initial system). 

Given the initial system Sq we define the directed labeled graph Go as follows: 

• Go contains a node for each occurrence of a function symbol in £q: without 
loss of generality, we assume that each occurrence of a constant c has been 
replaced with a term c{B), where -B is a fixed variable, and c is a new unary 
function symbol. 

• Go contains a node for each variable in Sq. 

• For each term /(ii, . . . , i„) (/ difi'erent from U) in Sq, ii ij, is the node created 
for the specific occurrence of/, and Vi is the node created for the main functor 
of ti (or for the variable ti), then the edge {^ijVi) with label +1 is added to 
Go 

• let t be a term U • • • U i„ such that: n > 1, the main functor of each ti is 
different from U, and either 

— the term t is the left-hand side or the right-hand side of an equation in 
So; or 

— there exists a term f{t\, . . . ,tn) in f o such that t = U and / is different 
from U. 

Let ^, be the node introduced for the first occurrence of U in i.e., 

ti ^ <2 U • • • U t„ 

t 

and let Vi be the node created for the main functor of ti (or for the variable 
ti); the graph Gq contains the edges {fx, v^) with label for i = 1, . . . , n. 

• remove from Go all the nodes created for occurrences of U which do not have 
any outgoing edges. 

Example 9 

Let ^o be the system f{X)\Jf{g{Y)) = f{g{Z) U h{Z , F)) U F U W. Then Go is 
the graph (thick lines are used for +1 edges while thin lines are used for edges): 



U 
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Let us define an iteration to be a single appfication of a rule of the procedure 
general_aci. Each rule of the unification algorithm can be mapped onto an oper- 
ation on the graph. If £i is the system obtained after i iterations of the unification 

algorithm, then we denote with the corresponding graph. The graph operations 
corresponding to the different non-failing unification rules are the following: 

1. if £i+i is obtained by removing an equation X = X from £i, then Gi+i = Gi 

2. if fj-i-i is obtained by replacing t = X with X = t in £i, then G^+i = Gi 

4. if f j+i is obtained by replacing each occurrence of X with i in fj, then G^+i is 
obtained by adding the edge {fi, u) with label 0, where ji is the node associated 
with the variable X and v is the node created for the main functor of term t 
(or for the variable t) 

5. if fj+i is obtained by replacing the equation X = t where i = /i(si) U • • • U 

U Vi U • • • U Vm U X (assumed in this ordered form as explained in the 
unification algorithm note that this simplification is not needed in the graph 
representation) with the equation X = /i (si) U • • • U/„ (s„) U Fi U • • • U Vm UN, 
N new variable, then Gi+i is obtained by adding a new node u for N, by 
removing the edge (/x, ^) where /i is the node for the functor of t and ^ the 
node for X , and by adding the new edge (/i, i^) 

7. if fi+i is obtained by replacing the equation f{ti, . . . ,t„) = /(si, . . . , s„) in 
£i, then Gj+i = 

8. let us assume that £i+i is obtained by replacing the equation 

/l(si) U • • -UfniSm) UXiU---UXh = ffl(il) U • • • U gn{tn) U Fi U • • • U Ffc 

in £i with a family of equations: 

fi{si) = gj{tj) for some i,j 
and by substituting Xi (Y^) with terms of the form: 

Xi = giSh) U • • • U U #1 U • • • U AT, 

(similarly for Yj). Gj+i is obtained from Gj as follows: 

— introducing a new node Vi for each new variable Ni 

— if is the node for Xi [Yi) and r)j is the node for the main functor of 
term gj(ij) (fjisj)), then add the edge and {p',rij) with label 
for each gj{tj) {fj{sj)) and for each present in the substitution for 
Xi (Yi). 

Lemma 4 

Let One{G) be the set of +1 edges present in the graph G. Then for each Gj ob- 
tained from the above transformations we have One{Gi) = One (Go). Furthermore, 
Gi does not contain any cycles which include edges labeled +1. 
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Proof 

The first property is obvious from the definition of the transformations. 

The second property is straightforward for the cases (1), (2), and (7) of the 
unification algorithm, since they do not add edges — and thus cannot generate cycles. 
Case (5) adds a new edge, but the destination of the edge is a new variable which 
has no outgoing edges. 

Case (4) can be seen as follows: let us assume, by contradiction, that the addition 
of the edge from the node of X to the node of t generates a cycle with +1 edges. 
This means that, before this operation, there exists a path from the root of t to 
the node of X (with at least one +1 edge). This path can be only the result of 
a sequence of edge additions leading from a node reachable from the root of t to 
the node of X. Each of these edges has been introduced during previous variable 
substitutions — and each of the nodes reachable using this path identifies a sub-term 
of t. Thus, X is a sub-term of t. This contradicts the possibility of applying case 
(4), since this situation is explicitly handled by case (3) and leads to a failure. 

Case (8) can be seen as a combination of cases (7) (new equations of the type 
/i(si) = 9j(ij) which do not modify the graph), (5) for the new variables Ni, and 
(4) for the substitution of existing variables. □ 

Lemma 5 

Let us assume that there is a non-failing sequence of k non-deterministic choices, 
such that general_aci(f ) generates (one per each successive iteration) the systems 
£ = £^°\£''^\£^^\ . . . , £'^''\ Let p be the number of occurrences of function symbols 
in £^^\ Then, there exists 

lev.vars '^^^^ j — 

such that: 

• it fulfills condition (*) of Dcf.^Jfor all systems of equations f (i.e., lev{i) < 
p and lev{r) < p for all the equations £ ^ r m £'-^-'), and 

• any time a substitution [X/i] has been applied, then lev{X) — lev{t). 

Proof 

Let us consider the graphs Gj associated to the systems £^^\ First of all, observe 
that if there is a function fulfilling the requirements for the system £^J\ then the 
same function works for all graphs i?^'-* with i < j. This allows us to concentrate 
on £^''\ By Lemma01 we know that Gk is acyclic and it contains the same number 
(p) of -1-1 edges as £^^\ From this fact, starting from leaf nodes and going back on 
edges, augmenting a value only if a -f 1 edge is encountered, it is natural to find a 
function lev fulfilling the required property. □ 

Theorem 3 [termination) 

Given a system of equations f , all the non-deterministic branches of the computa- 
tion of general_aci(f ) terminate in a finite amount of time. 
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Proof 

Assume that there is a non-faiUng sequence of non-deterministic choices £^-^\£^^\ 
£^'^\ . . . , E'^^^ (they are the values of £ at the 0*'', 1**, 2"'', . . . , A;*'' iteration, respec- 
tively), and let p be the number of occurrences of function symbols in f^"-*. We 
know from LemmaElthat there exists a function lev : vars ^IJj>q f '--'■'^ — > N such 
that 

it fulfills condition (*) for all the systems of equations £^^\ and 

any time a substitution [X/t] has been applied, then lev(X) = lev{t), 

We call this property condition (a). 

Picking such a lev, we define a measure of complexity Cs for the system of 
equations £: 

4'™) = [#(2p),#(2p-l),#(2p-2),...,#(l),#(0)] 
where returns the number of equations not in solved form £ — r m £ such 

that lev{£) + lev{r) — j. The ordering between two lists of this form is the usual 
well-founded lexicographical ordering. 

Let h be the number of equations in the initial system. The initial tuple 
necessarily less than or equal to [h,0, . . . ,0]. Let us consider how the various rules 
in Figure El modify the complexity measure tuple: 

rule (1) clearly reduces the complexity by removing one equation 
rule (2) does not affect the complexity but can be safely ignored (we could easily 
rewrite the algorithm without it by adding explicit cases for equations t = X 
wherever we analyze X = t) 

rule (4) reduces the complexity: in fact one equation of complexity 2lev{X) is 
removed, while the rest of the system is unaffected, since X is replaced by a term 
with the same level 

rule (5) will lead in one additional iteration to an rule (4), which means that the 
complexity of the original equation must be 2lev{X); by assigning lev{N) = lev{X) 
we have that after two reductions the complexity will decrease 
rule (7) replaces an equation of complexity 2 + li + ri with a collection of equations 
each having complexity l + r < /i + ri , leading to a smaller total complexity (thanks 
to lexicographical ordering) 

rule (8) is a complex rule which leads to the execution of the aci_step function. 
Let e be the equation communicated to aci_step. The only equations in non-solved 
form that are generated by aci_step are the equations s = s' present in £^ns- Such 
an equation s — s' originates from simplifying an equation /(s) — /(s') U • • •. 
Observe that in this equation f{s) and /(s') originally appeared on distinct sides of 
the equation e — in the general structure of the equation, one of the two is a fj{lj) 
and the other is a gi{fi). Thus, the equation /(s) — f{s') has a complexity which 
is less or equal than that of e, which implies also that the complexity of s = s' is 
strictly lower than that of e. Thus the original equation is replaced by a collection 
of equations of smaller complexity (assuming, as stated earlier, that the equations 
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of the form Li = ti and Rj = Sj ■ ■ ■ that lead to p are all such that lev(Li) = lev{ti) 
and lev{Rj) = lev{sj)). 

Thus, every rule application decreases the complexity measure The lexico- 

graphical ordering on constant-length lists of non- negative integers is a well-founded 
ordering, and thus this activity cannot be done indefinitely. 

However, this is not sufficient for termination, since we are not sure that the 
complexity measure tuple reaches the value [0, . . . , 0] within k rule applications. 
Moreover, we do not know if the function lev fulfills condition (a) for the successive 
systems £ik+i) ^ £{k+2) ^ _ ^ 

To prove termination, a further measure is needed: let 

Ms — ^C^p : £ is a. function from vars{£) to {0, ■ . ■ ,p} that fulfills condition (a)]} 

Multisets of tuples are governed by (well-founded) multiset ordering. 

AA^io) is finite. All the initial tuples are less than or equal to [r, 0, . . . , 0]; each of 
them is associated to a function from vars to {0,...,p} that fulfills condition 
(a). 

Let us consider this multiset and the effects of an iteration over each of its tuples. 
After one iteration it holds that: 

• The function £ fulfills condition (a) for the successive systems. In this case t is 
replaced by a fewer tuple (see the proof above). 

• The function £ does not assign values to new variables. However, it is possible to 
extend £ into £' in order to assign values for these variables. In this case the tuple 
t is replaced by a certain (finite) number of tuples fewer than t (the new variables 
TV are introduced in equations of the form X — ■ ■ ■ (J N and thus, £'{N) < £{X)). 

• The function £ does not fulfill condition (a) for the new system and, moreover, it is 
not possible to extend £ into £' in order to assign values for these variables to fulfill 
condition (a). In this case the tuple t is simply removed from the multiset. 

Since multiset ordering is well-founded, this ensures termination. □ 



Appendix B Matrix for Term Propagation 

In this section we briefly show how it is possible to compute automatically the 
output equations of the Term Propagation phase of the General ACI unification 
algorithm (Section |^2J- The method we propose builds on the solution of the ACI 
unification with constants problem based on ^C/-matrices; the novelty is the use of 
a simplified form of j4C7-matrix that takes advantage of the format of the equations 
to be dealt with in this context — i.e., elementary ACIl equations. 
Given an elementary ACIl unification problem 

U ■ • ■ U A"™ U Xi U • ■ • U Xp = Ti U ■ • ■ U T„ U Xi U • ■ • U Xp 
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the elementary ACI -matrix is as follows: 



Si 


Sn 




Xp 






^l,n+l 




Am,l 


A 










^m + l,n + l 




Afn+p^l 









T 



However, variables Am+i,n+j with j > 0, j > 0, i 7^ j are not used and thus we can 
avoid to introduce them. The most general unifier for the elementary problem can 
be obtained as follows: 



u 

i = l 



m+i,k ^7n+j ,n+j 

fc=l 



j = l 



One can easily prove that this method provides the same solution as the ACI unifi- 
cation with constants algorithm based on Boolean ACI matrices of (jBaader and Biittner 1988|l . 
briefly recalled in Sectional 



Example 10 

Let us consider the same unification problem S'l U ^2 U X 
Example 13 the elementary j4C/- matrix is 



Ti U T2 U X as in 



^1 


S2 


X 




Ri 


R3 


R7 


Ti 


R2 


R4 


i?8 




R5 


Re 


Rg 


X 



Let us observe that the variables in the matrix have been named to show the 
correspondence with the new variables used in Example 13 

Given the unification problem: 

= N,^ ^ Ml) /\ ■ ■ ■ ^ ^ fkd\) ^ 
= <?i(ri) A • • • A 7V^^ = ghArh,) A 
N^^ U ■ ■ ■ U N^^U LiU ■ ■ ■ U Lk^ ^ Nf U ■ ■ ■ U N^^L) RiU ■ ■ ■ U N^^ 
we solve the elementary ACIl problem on the equation: 

iV/- U • • • U iVfc^ U Li U • • • U Lfc2 = A^i^ U • • • U A^;^ U i?i U • • • U 

We build an auxiliary Boolean matrix B that allows us to reduce the non-determinism. 
We deal with two cases: 
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• If {Li, . . . , ifcj} n {i?i, . . . , Rh2} — (non-deterministic) solution can be 
described using a (/ii + /12) x (fci + ^2) matrix B such that 

— for /ii + 1 < i < /ii + /i2 and /ci + 1 < j < fci + A;2 we have B[i,i] = _L 

— all the other components of B have a value taken from {0, 1} 

— for each 1 < « < /ii X^^lt*^^ ^[^j] ^ 1 and for each 1 < j < fci 

E-il'^5[z,j]>i. 

Thus, i? is a boolean matrix with the exception of the fourth quadrant, where 
the matrix contains only the value _L. The matrix B can be used to describe 
the substitution A: 



A,,, ifB[z,j]=± 

ifB[i,j]=0 

h{n) if - 1 Aj > /ii 

h{l,) if - lAj < /ii 

Additionally, B generates the new set of equations: 

B[i,j\ = l/\l<i<ki/\l<j<hi 

• Assume now that the two sides of the equation share some variables. I.e., let 
us assume that the problem at hand is 

iV/' U • • • U iV^f; U Li U • • • U Lfca U Comi U • • • U Conic = 

A^/^ U • • • U A^,^ U i?i U • • • U Rh^ U Comi U • • • U Come 

The solution of the problem in this case can be built around the elementary 
^C7-matrix shown in Figure lBTl The table in Figure IbH assumes h — hi + h2 
and k = ^1 + ^2. The solution of the ACI problem, in this case, will be 
composed of equations of the form: 





hi 


hi+h2 


hi+h2 + c 








u U ^^-ti+j 




i=l 


i=hi+l 


i=hi+h2 + l 




ki 


ki + k2 


ki+k2 + c 


R, 


= U Ahi+i,] 


U U Ah,+^.j 


U U ^/u+.j 






]=ki+i 


J=fcl+fc2 + l 




/ll+/l2 + C 


ki + k2-\ 


-c 


Comy 


= u 


,ki + k2 + v U [J 






1=1 


J = l 





In Figure lB2l we depict the boolean matrix B which will be used in this case. 
The matrix B should satisfy the following properties: 

— quadrant 5, 6, and 8 are filled with _L; 

— the non-zero entries in quadrant 9 are assigned _L; observe that the 
quadrant 9 is a diagonal matrix with non-zero elements only along the 
main diagonal; 

— quadrant 1, 2, 3, 4, and 7 are boolean matrices; 

— for 1 < J < h we have E tt"' B[i,j] + E -=1 B[i,j] > 1 
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— for 1 < z < /ii we have E'lt"' + ^,=1 B[i,j] > 1 

The substitution A and the collection of new equations £;conf ^j.g dgfijigfj 
exactly as above. 



Li 



Comi 



1 ... 1 


Ai,ki+i 1 • ■ • 1 


A\^k+i 1 ■ ■ ■ 1 ^i,fc+c 












Ah-iM 


Ahi,k-i+i 




Ahi,k 


Ahi,k+1 




Ah-i,k + c 






Ah-i+iM 


^4i+i,fei+i 




Aht+\,k 


Ah-i + l,k+l 




Ahi+l,k+c 








Ah,i 




Ah,k-i 


Ah,ki+i 




Ah,k 


Ah,k + 1 




Ah,k + c 






Ah+iM 


Ah+1M+'^ 




Ah+l,k 


Ah+l,k+l 




Ah+l,k+c 








Ah+c,i 1 ■ • • 1 Ah+cM 


^fe+c./ci+l 1 ■•■ 1 ^/l+C,fc 


Ah+c,k+l 1 ••• 1 ^h+c,fc+c 



Conic 



7V« 

hi 
ill 



Rh2 

Comi 



Conic 



Fig. B 1. Elementary ^C/-matrix 



Li 



Comi 



Conic 











Quad 1 


Quad 2 


Quad 3 


K 








Ri 


Quad 4 


Quad 5 


Quad 6 


Rh2 






Quad 9 


Comi 


Quad 7 


Quad 8 




Come 



Fig. B 2. Extended Boolean Matrix B 



